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FLUORESCENT DYE BINDING PEPTIDFS 

This application is a continuation-in-part of application Serial No. 60/104.465 filed October 16, 
1998. 

FIELD OF THE INVENTION 

5 The invention relates to peptides that bind to fluorescent dyes, termed "fluorettes", and to methods 
of making and using the fluorettes. In particular, the fluorettes can be used in detection and assay 
systems in vitro and in vivo. 

BACKGROUND OF THE INVENTION 

Fiuorophore dyes, due to their exquisite sensitivity and ease of use. are widely used in numerous 
1 0 approaches in fluorescent microscopy, flow cytometry and other detection systems (Haugland. 
Handbook of fluorescent probes and research chemicals (sixth edition). Molecular Probes, Inc.. 
Eugene OR (1996) and ref. therein). 

Detection of proteins in living cells using fluorescence approaches has been accomplished in a 
variety of settings. For instance, it is possible to use ligands (or naturally-derived antibodies) 

1 5 conjugated directly or indirectly to fluorophores as probes of the expression levels of nearly any 
given surface-expressed protein on living cells. In some particular cases for proteins within cells, 
it is possible to use permeable ligands for individual target proteins. In these cases the ligand is 
either self-fluorescent, becomes fluorescent upon binding or is conjugated to fluorescent adducts. 
In other cases, it has been possible for many years to genetically fuse reporter enzymes such as 

2 0 p-galactosidase, P-glucuronidase, and P-glucosidase to proteins and use a fluorogenic dye. acted 
upon by the reporter enzyme(s). to assay enzymatic activity on a cell by cell basis (Nolan etal. 
Proc. Natl. Acad. Sci. U.S.A. 85:2603-2607 (1988); Lorincz et aL Cytometry 24:321-329 (1996); 
Krasnow ef a/. Science. 251:81-85 (1991). Other systems, including p-lactamase (ZIokarnik et 
aL Science 279:84-88(1998)) build upon those findings by applying dyes with increased cell 

25 permeability or having radiometric fluorescent qualities that might have advantages in some 
applications. In recent years proteins with inherent fluorescence, such as Green Fluorescent 
Protein (Welsh and Kay. Curr. Opin. Biotechnol. 8:617-622 (1997); Misteli and Specter. Nat. 
Biotechnol. 15:961-964 (1997))-have become widespread in their application owing to ease of 
use. the availability of mutant proteins with differing spectral qualities in either excitation or 



emission, and the relative non-toxicity of the approach. 

However, in the aforementioned cases the approaches are limited by a need to genetically fuse a 
relatively bulky reporter protein to the molecule under study. This can have detrimental 
consequences to the functionality of the protein in question or interfere mechanistically with 
5 cellular constituents with which the protein interacts. While it would be best to directly measure a 
given target protein using a specific fluorescent dye that recognized any given target moiety, no 
technology exists as yet to create such reagents. 

There Is a need, therefore, to develop approaches that provide the building blocks for specific 
biomaterial detection. 

10 SUMMARY OF THE INVENTION 

The invention provides peptides that bind to fluorophore dyes. In one aspect of the invention the 
peptides are made of naturally occuring amino acids, non-naturally occurring amino acids, or 
combinations thereof. 

In another aspect of the invention, methods are provided for isolating and identifying peptides that 
15 bind to fluorophore dyes. The method comprises creating and screening peptide libraries that bind 
to fluorophores. 

In another aspect of the invention, methods are provided for increasing the binding affinity of the 
fluorette for a fluorophore. 

In a further aspect of the invention are provided complexes of fluorettes bound to fluorophore 
2 0 dyes. The binding of the fluorophore by the fluorette may alter the excitation and/or the emission 
spectrum of the fluorophore. 

In an additional aspect, the present invention provides methods for detecting a fluorette by binding 
a fluorette to a fluorophore dye and detecting the fluorette/fluorophore dye complex. 

BRIEF DESCRIPTION OF THE DRA\A/INGS 
25 Figure 1. Fluorophore dye carriers. Fluorescein. Oregon Green 514, Rhodamine Red and Texas 
Red activated derivatives were covalently linked to the polymer carrier Ultralink Immobilized 
DADPA via a 12-atom diaminodipropylamine spacer. A spacer is shown as a horizontal thick bar. 
Digits with arrows in the chemical structures of all fluorophore dyes, except Oregon Green 514, 
show that "mixed isomers" of these fluorophore dye activated derivatives were used for a coupling. 
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Figure 2. Blopanning with a peptide phage display library against Fluorescein. Oregon Green 514, 
Rhodamine Red and Texas Red carriers. A Ph.D,-12 phage display peptide library (New England 
Biolabs) is a combinatorial library of 12-mer peptides fused to the amino-terminus of a minor coat 
pill protein of the bacteriophage M13mp19. Bound phage in four biopanning rounds were 
5 calculated as a percentage of eluted phage plaque forming unit (plaque forming unit (pfu)) yield vs 
input phage plaque forming unit (pfu). All experiments were performed in duplicate. Standard 
deviations did not exceed 19%. 

Figure 3. Phage-fluorophore dye binding. Fluorophore dye-specific OG403, OG402, RhR401 and 
TR401 phage were bound to Oregon Green 514, Fluorescein, Rhodamine Red and Texas Red 

10 "free" dyes, respectively (see text). The normalized particle amounts of amplified Ph.D.-12 phage 
display peptide library (nonspecific phage) were bound in solution with the same dyes as controls. 
Phage-fluorophore dye complexes were precipitated three times by PEG to remove unbound dye 
and spotted to a nitrocellulose filter. The filter was scanned on the Storm 840 scanner (Molecular 
Dynamics). The presented data is from a single scan and image enhancements were done 

1 5 simultaneously on the complete set of dye binding results using NIH Image 1 .59 software. 

Specific/nonspecific signal ratios were quantified by densitometry of spot images using NIH Image 
1.59 software (see Materials and Methods for details). All binding experiments shown were 
completed in duplicate. Densitometry was carried out prior to contrast enhancement. Brightness 
and contrast values were modified to 50 and 65. respectively in Adobe Photoshop. Individual 

2 0 elements of the figure were arranged as a composite from the same original scan from which the 

binding ratios were determined. 

Figure 4. Biopanning with four biased combinatorial phage display peptide libraries based on 
degenerated fluorettes against Fluorescein, Oregon Green 514, Rhodamine Red and Texas Red 
carriers. OG402-91CL, OG403-91CL. RhR401-91CL and TR401-91CL biased combinatorial 
25 phage display peptide libraries (see text) were subjected to a biopanning against Fluorescein. 
Oregon Green 514, Rhodamine Red and Texas Red carriers, respectively. Bound phage in three 
biopanning rounds were calculated as described in a legend to Figure 2. All experiments were 
performed in duplicate. Standard deviations did not exceed 21%. 

Figure 5A and 5B. Peptide - Texas Red binding. Excitation and emission spectra. Peptides at the 

3 0 noted concentrations were incubated with Texas Red (50nM). or without, in 0.6ml of TBS for 1 hr at 

room temperature (RT). Samples were scanned on a spectrofluorimeter (SPEX Fluoromax (Jobin 
Yvon-SPEX Instruments Co.)). Peak excitation or emission positions are shown in the figure using 
vertical bars. Excitation and excitation/emission peak shifts for PepTR401(10pM)/Texas Red and 
PepTRS311 (10MM)/Texas Red complexes, respectively (A), were reproducible in three 
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independent experiments. The results of a single experiment are presented. 

Figure 6. Texas Red-specific peptide PepTR401 (Panel A). PepTRS311 (Panel B). PepTR406 
(Panel C) and nonspecific PepControl (Panel D) (see peptide sequences In Table 4) were bound 
via Hisg tag to cobalt ion-containing Sepharose beads. The peptide coated beats were washed 
5 and incubated with 0.5 micromolar Texas Red in TBS for 1 hour at room temperature followed by 
several washings of beads in order to remove unbound dye. Fluorescent and nonflourescent 
control beads were photographed on fluorescent microscope Axlophot (Zeiss) using Rhodamine 
Red/Texas Red filter with the same time exposure for every sample. All binding experiments were 
accomplished in three parallels and results of a single experiments are shown. 

10 Figure 7A-D . Peptide - Texas Red binding. Texas Red-specific peptides TR401 (linear) (Figure 
6B). TRP501 (SKVILFE-flanked) (Figure 6C), TRP512 (SKVILFE-flanked) (Figure 6D). and 
nonspecific peptide (SKVILFE-flanked) (Figure 6A) as a negative control were bound via 
polyhistidine (HiSg) tag to cobalt ion-containing Sepharose beads. The peptlde-coated beads were 
washed and incubated with 0.5 pM Texas Red in TBS buffer for 1 hour at room temperature 

1 5 followed by several washings of beads in order to remove unbound dye. Fluorescent and 
nonfluorescent control beads were photographed on fluorescent microscope Axiophot (Zeiss) 
using Rhodamine Red/Texas Red filter with the same time exposure for every sample. All binding 
experiments were accomplished in duplicate and results of a single experiment are shown. 

DETAILED DESCRIPTION OF THE INVENTION 

2 0 The present invention provides novel peptides or "fluorettes" that specifically bind fluorophore 
dyes. The present invention also provides novel compositions comprising multimeric fluorettes 
and fluorettes fused or linked to other compounds or molecules. Also provided are methods to 
create and modify fluorettes. 

The present invention further provides methods for the use of fluorettes in detecting biological 

2 5 materials, molecules, or target analytes. intracellular events, and intermolecular and intramolecular 

interactions. The fluorettes also find use in in vitro assays and high throughput screens. The 
methods provided are based on the formation and detection of a fluorette/fluorophore dye 
complex. 

The compositions and methods of the present invention provide a significant improvement over 

3 0 conventional light-emitting techniques. Fluorettes. due to their small size, are not intrusive to the 
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systems being studied and. therefore, permit detection and analysis of a target moiety or molecule 
while minimizing target modification. Current tight-emitting detection methods involve the use of 
enzyme-generated fluorochromes. luciferase-generated light, or reporter systems using 
engineered, inherently fluorescent proteins such as Aequoria victoria green fluorescent protein 
5 (GFP). These systems are unwieldy and bulky and are intrusive to the system being studied; 
often necessitating the synthesis of large fusion proteins or out of context genomic constructions. 
Thus, conventional techniques can involve substantial genetic manipulations of the targets one is 
trying to study, which may disrupt, interfere, or alter the process being measured. Fluorettes 
minimize or avoid these limitations. 

10 Accordingly, the present invention provides a peptide, sometimes termed a "fluorette" herein, that 
will bind a fluorescent dye. 

By "peptide" or "fluorette" herein is meant at least about 8 covalently attached amino acids. The 
peptide may be made up of naturally occurring amino acids and peptide bonds, or synthetic 
peptidomimetic structures. Thus "amino acid", or "peptide residue", as used herein means both 
15 naturally occurring and synthetic amino acids. For example, homo-phenylalanine. citrulline and 
noreleucine are considered amino acids for the purposes of the invention. "Amino acid" also 
includes imino acid residues such as proline and hydroxyproline. The side chains may be in 
either the (R) or the (S) configuration. In the preferred embodiment, the amino acids are in the (S) 
or L-configuration. If non-naturally occurring side chains are used, non-amino acid substituents 

2 0 may be used, for example to prevent or retard in vivo degradations. The peptides can be linear or 

branched. Amino acids includes naturally occurring amino acids or non-naturally occurring amino 
acids. 

By "naturally-occurring amino acids" herein is meant amino acids that are produced by living 
organisms. Naturally-occurring amino acids are preferred in in vivo embodiments. 

25 By "non-naturally-occurring amino acid's" herein is meant amino acids not produced by a living ^ 
organism but that can be chemically synthesized. Non-naturally-occurring amino acids include, for 
example, stereoisomers or enantiomers of the naturally-occurring amino acids in which the amino 
side chains, are in either the (D)((R)) or (S)((L)) configuration; amino acids in which an amine 
group is bonded to any other but the alpha-carbon or may have more than one amine group 

3 0 bonded to the alpha-carbon; and amino acids having functional groups or "R" groups not known to 

occur in nature. 
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The length of the peptide or fluorette, i.e., the number of amino acids, will vary. In general, the 
number of amino acids varies from about 8 to about 50. with from about 8 to about 40 being 
preferred, from about 8 to about 25 being particularly preferred, and from about 8 to about 12 
being especially preferred. Thus, "peptide" includes peptides, oligopeptides, and in some cases, 
5 proteins. 

The fluorette can be monomeric or multimeric. That is, a monomeric fluorette binds a single 
fluorophore. A multimeric fluorette is two or more associated monomeric fluorettes. The 
association of a multimeric fluorette can be either covalent or non-covalent. For example, the 
monomers can be joined directly together, for example as a linear fusion (i.e., the carboxy 
10 terminus of one is joined to the amino terminus of the second), or as a branched fusion, wherein 
the attachment of the second monomer is other than to the backbone of the first. In addition, a 
chemical cross-linker or inclusion of specific reactive group on a fluorette be used to join fluorette 
monomers. Alternatively the monomers may be non-covalently associated; for example, as is 
more fully outlined below, dimerization sequences can be used to associate two monomers. 

15 A multimeric fluorette can be homomultimeric, i.e., all the monomers are the same, and bind the 
same fluorophore dye, or heteromultimeric, having at least two different fluorette monomers and 
can bind different fluorophores. 

The fluorette peptides bind to at least one fluorophore dye. 

By "binding" herein generally is meant a non-covalent association or interaction between a 
2 0 fluorette and a fluorescent dye. The non-covalent interactions between the fluorette and dye may 
involve various types of electrostatic, hydrophilic. and hydrophobic interactions. Binding may also 
involve forming one or more covalent bonds between the fluorette and the dye. Covalent bonds 
can be formed directly between the fluorette and the dye or can be formed by a cross linker or by 
inclusion of a specific reactive group on either the fluorette or dye or both molecules. Binding may 

2 5 also involve a combination of covalent and non-covalent interactions. 

In a preferred embodiment, the binding is specific. By "specific binding" herein is meant that the 
fluorophores will preferentially bind to a fluorophore dye with a binding constant in the range of at 
least about 10-^ M'^ to about 10"'° M-\ with a preferred range being from at least about 10^ M'^ to 
about 10 ' M-\ with an especially preferred range of from at least about 10'^ M'^ to about 10"^° M'\ 

3 0 In a preferred embodiment, the fluorettes do not specifically bind to other fluorophore dyes or 

compounds or moieties; that is. a fluorette is specific to one fluorophore. Alternatively, a fluorette 
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may bind two or more fluorescent dyes (bind together 2 or more fluorophores or more or capable 
of binding independently two or more fluorophores); that is. a fluorette may specifically bind 
another compound or moiety, including another fluorophore, If these molecules have a common 
structural feature(s) that specifically interact with the fluorette. Preferably, however, a fluorette will 
5 not appreciably bind compounds other than fluorescent dyes. 

In other embodiments, a dye is non-fluorescent but becomes fluorescent when bound by a 
fluorette. In further embodiments, fluorette binding to a non-fluorescent dye causes the release of 
a fluorescent moiety. When the fluorescent moiety is released, the fluorette may bind to the 
released fluorescent moiety, may bind to the non-fluorescent dye, or may be released and 
10 therefore free to bind a second molecule of the non-fluorescent dye and repeat this process. 

By "fluorescent dye" or "fluorophore" or "fluorophore dye" herein is meant a compound that 
absorbs an incident light of a characteristic range of wavelengths or excitation spectrum and 
dissipates the absorbed energy by emitting light of a characteristic range of wavelengths or 
emission spectrum. By "excitation spectrum" herein is meant the wavelength of incident light 
1 5 absorbed by the fluorophore dye that causes the fluorophore dye to fluoresce. By "emission 
spectrum" herein is meant the characteristic wavelengths of the emitted or fluorescent light 
produced as the energy of the absorbed incident light is released. The excitation and emission 
spectra for a fluorophore dye may or may not overlap. In a preferred embodiment the incident 
light is in the ultraviolet spectrum and the emitted light is in the visible spectrum. 

2 0 Preferred fluorophores include, but are not limited to, Texas Red, Rhodamine Red. Oregon Green 
514. and Fluorescein. Examples of fluorescent dyes are found in the Molecular Probes Catalog. 
6th Ed., Richard Haugland. Ed., which is expressly incorporated by reference in its entirety. 

In addition, the fluorette can further comprise additional components, such as a fusion partner or 
functional group. By "fusion partner" or "functional group" herein is meant a sequence that is 

2 5 associated with the fluorettes. that confers upon the fluorette a function or ability. Fusion partners 

can be heterologous (i.e. not native to a host cell), or synthetic (not native to any cell). Suitable 
fusion partners include, but are not limited to: a) presentation structures, as defined below, which 
provide the fluorettes in a conformatlonally restricted or stable form; b) targeting sequences, 
defined below, which allow the localization of the flourette into a subcellular or extracellular 

3 0 compartment: c) rescue sequences as defined below, which allows the purification or isolation of 

the fluorette; d) stability sequences, which confer stability or protection from degradation to the 
fluorette. for example resistance to proteolytic degradation; e) dimerization sequences, to allow for 
fluorette dimerization or multimerization; or f) any combination of a), b). c). d), and e). as well as 



linker sequences as needed. 

In a preferred embodiment, the fusion partner is a presentation structure. By "presentation 
structure" or grammatical equivalents herein is meant a sequence, which, when fused to the 
fluorette, causes the fluorette to assume a conformatlonally restricted form. Proteins interact with 
5 other proteins and molecules largely through conformatlonally constrained domains. Although 
small peptides with freely rotating amino- and carboxy-termini can have potent functions as is 
known in the art. the conversion of such peptide structures into active agents can be difficult due 
to the inability to predict side-chain positions for peptidomimetic synthesis. Therefore the 
presentation of fluorettes in conformatlonally constrained structures will likely lead to higher affinity 
10 interactions of the fluorette with its target fluorophore dye. This fact has been recognized in the 
combinatorial library generation systems using biologically generated short peptides in bacterial 
phage systems. A number of workers have constructed small domain molecules in which one 
might present biased-combinatorial libraries of peptide structures. 

While the fluorettes are peptides, presentation structures are preferably peptides or proteins. 
15 Thus, synthetic presentation structures, i.e. artificial polypeptides, are capable of presenting 
fluorettes as a conformationally-restricted domain. Generally such presentation structures 
comprise a first portion joined to the N-terminal end of the flourette. and a second portion joined to 
the C-terminal end of the flourette ; that is. the flourette is inserted into the presentation structure, 
although variations may be made, as outlined below. To increase the affinity or specificity of the 

2 0 fluorette, the presentation structures are selected or designed to have minimal biologically activity 

when expressed in the target cell. 

Preferred presentation structures maximize accessibility to the flourette by presenting it on an 
exterior loop. Accordingly, suitable presentation structures include, but are not limited to. minibody 
structures, loops on beta-sheet turns and coiled-coil stem structures in which residues not critical 
25 to structure are randomized, zinc-finger domains, cysteine-linked (disulfide) structures, 

transglutaminase linked structures, cyclic peptides. B-loop structures, helical barrels or bundles, 
leucine zipper motifs, etc. In addition, presentation structures include dimerization sequences, as 
defined below. 

In a preferred embodiment, the presentation structure is a coiled-coil structure, allowing the 

3 0 presentation of the flourette on an exterior loop. (See. for example. Myszka et al.. Biochem. 

33:2362-2373 (1994). hereby incorporated by reference). Using this system investigators have 
isolated peptides capable of high affinity interaction with the appropriate target. In general, coiled- 
coil structures allow for between 6 to 20 amino acids. 
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A preferred coiled-coil presentation structure is as follows: 

MGCAALESEVSALESEVASLESEVAALXt. ,LAAVKSKLSAVKSKLASVKSKLAA CGPP (SEQ ID 
NO:49). The underlined regions represent a coiled-coil leucine zipper region defined previously 
(see Martin et a!., EMBO J. 13(22):5303-5309 (1994), incorporated by reference). The bolded X^n, 
5 region represents the loop structure and when appropriately replaced with peptides (i.e. fluorettes , 
generally depicted herein as (X)^, where X is an amino acid residue and n is an integer of at least 
5 or 6) can be of variable length. The replacement of the bolded region is facilitated by encoding 
restriction endonuclease sites in the underlined regions, which allows the direct incorporation of 
oligonucleotides at these positions. For example, a preferred embodiment generates an Xhol site 
10 at the double underlined LE site and a Hindlll site at the double-underlined KL site. 

In a preferred embodiment, the presentation structure is a minibody structure. A "minibody" Is 
essentially composed of a minimal antibody complementarity region. The minibody presentation 
structure generally provides two randomizing regions that in the folded protein are presented along 
a single face of the tertiary structure. (See for example Bianchi et al., J. MoL Biol. 236(2):649-59 
15 (1994). and references cited therein, all of which are incorporated by reference). Investigators 
have shown this minimal domain is stable in solution and have used phage selection systems in 
combinatorial libraries to select minibodies with peptide regions exhibiting high affinity, Kd = 10"^, 
for the pro-inflammatory cytokine [L-6. 

A preferred minibody presentation structure is as follows: 
2 0 MGRNSQATSGFIFSHFYMEWVRGGEYIAASR HKHNKYT TEYSASVKGRYIVSRDTSQc^n Yl nK 
KKGPP (SEQ ID NO:50). The bold, underline regions are the regions which may be contain 
fluorette sequences. The italicized phenylalanine must be invariant in the first randomizing region. 
The entire peptide is cloned in a three-oligonucleotide variation of the coiled-coil embodiment, thus 
allowing two different randomizing regions to be incorporated simultaneously. This embodiment 

2 5 utilizes non-palindromic BstXI sites on the termini. 

In a preferred embodiment, the presentation structure is a sequence that contains generally two 
cysteine residues, such that a disulfide bond may be formed, resulting in a conformationally 
constrained sequence. This embodiment is particularly preferred when fluorettes are to be 
secreted from a cell. As will be appreciated by those in the art. any number of flourettes. with or 

3 0 without spacer or linking sequences, may be flanked with cysteine residues. In other 

embodiments, effective presentation structures may be generated by the flourettes themselves. 
For example, the flourettes may be "doped" with cysteine residues which, under the appropriate 
redox conditions, may result in highly crosslinked structured conformations, similar to a 
presentation structure. Similarly, the flourettes may be controlled to contain a certain number of 
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residues to confer (l-sheet or a-helical structures. 

In a preferred embodiment, a presentation structure is neuropeptide head activator. Bodenmuller 
et al EMBO J. 5(8), 1825-1829 (1983) showed that neuropeptide head activator (HAv) dimerized 
to yield a biologically inactive form of the peptide at concentration as low as 10-^^M, thus indicating 
5 extremely high self-binding affinity. Aldwin et al. U.S. Patent No. 5,491,074 observed that a 

fragment containing the last six amino acids of this peptide's carboxyl terminus (SKVILF) (SEQ ID 
NO:51) resulted in dimers that were even more stable than the HA itself. The USPN 5,491 .074 
shows that the last amino acid of SKVILF, the F (phenylalanine), must be on the carboxyl terminus 
for a proper peptide dimerization activity. However, if F is not on the carboxyl terminus, it must be 

10 followed by one of two amino acids with free carboxyl group, E or D, to maintain the peptide 

dimerization. Therefore, if the dimerizing peptide positioned inside a carrier protein, 6-mer SKVILF 
sequence must be converted to 7-mer SKVILFE (SEQ ID NO:51) or SKVILFD (SEQ ID NO:53) 
sequences. For example, a presentation structure comprising neuropeptide head activator 
peptides comprises: SKVILFE/D(X)„SKVILFE/D (SEQ ID NO:1 12); wherein X represents any 

15 amino acid, and n is at least about 8 amino acids. Linkers or spacers as known in the art are 

optionally placed between the HAv peptides and the variable ((X)J region and optionally placed at 
the amino and carboxy terminus of the structure. 

Positioning of peptides of between two SKVILFE(D) sequences makes the peptides constrained. 
Upon SKVILFE(D) -SKVILFE(D) intramolecular interaction, the peptide structure is converted from 

2 0 linear to a loop or mini-domain such that the* peptide becomes structurally constrained. This 
provides three important advantages. First, peptides in constrained conformations usually have 
higher affinity for a ligand than the same peptide in linear conformation. Second, the constrained 
peptide has a higher protease resistance than the linear peptide as a result of the formation of a 
"rigid surface" structure. Third, and more importantly, transfer of a constrained peptide as a 

25 cassette to a protein carrier will not dramatically change its original conformation (and 

consequently its binding activity) while a linear peptide is more likely to show a decrease in binding 
activity. 

In a preferred embodiment, the fusion partner is a targeting sequence. As will be appreciated by 
those in the art, the localization of proteins within a cell is a simple method for increasing effective 
30 concentration. The localization of a peptide to a specific cellular location, such as a membrane, 
limits its search for its ligand to that limited dimensional space. Alternatively, the concentration of 
a protein or polypeptide can also be simply increased by nature of the localization. For example, 
shuttling the fluorettes into the nucleus confines them to a smaller space thereby increasing the 
concentration. Finally, the ligand or target may simply be localized to a specific compartment. As 

10 



will be appreciated by those in the art. when a fluorette is fused to another protein, as outlined 
herein, the targeting sequence can be fused to the protein partner as well. 

Thus, suitable targeting sequences include, but are not limited to. binding sequences capable of 
causing binding of the fluorette to a predetermined molecule or class of molecules while retaining 
5 activity of the fluorette, (for example by using enzyme inhibitor or substrate sequences to target a 
class of relevant enzymes): sequences signalling selective degradation, of itself or co-bound 
proteins; and signal sequences capable of constitutively localizing the fluorette to a predetermined 
cellular locale, including a) subcellular locations such as the Golgi, endoplasmic reticulum, 
nucleus, nucleoli, nuclear membrane, mitochondria, chloroplast, secretory vesicles, lysosome, and 
10 cellular membrane; and b) extracellular locations via a secretory signal. Particularly preferred is 
localization to either subcellular locations or to the outside of the cell via secretion. 



In a preferred embodiment, the targeting sequence is a nuclear localization signaP(NLS). NLSs 
are generally short, positively charged (basic) domains that serve to direct the entire protein in 
which they occur to the cell's nucleus. Numerous NLS amino acid sequences have been reported 

15 including single basic NLS's such as that of the SV40 (monkey virus) large T Antigen (Pro Lys Lys 
Lys Arg Lys Val) (SEQ ID NO:54). Kalderon (1984), et al.. Cell, 39:499-509; the human retinoic 
acid receptor-(i nuclear localization signal (ARRRRP) (SEQ ID NO:55); NFkB p50 
(EEVQRKRQKL SEQ ID NO:56); Ghosh et al.. Cell 62:1019 (1990); NFkB p65 (EEKRKRTYE 
(SEQ ID NO:57; Nolan et al.. Cell 64:961 (1991); and others (see for example Boulikas, J. Cell. 

20 Biochem. 55(1):32-58 (1994), hereby incorporated by reference) and double basic NLS's 

exemplified by that of the Xenopus (African clawed toad) protein, nucleoplasmin (Ala Val Lys Arg 
Pro Ala Ala Thr Lys Lys Ala Gly Gin Ala Lys Lys Lys Lys Leu Asp) (SEQ ID NO:58). Dingwall, et 
aL. Cell, 30:449-458. 1982 and Dingwall. etaL. J. Cell Biol.. 107:641-849; 1988). Numerous 
localization studies have demonstrated that NLSs incorporated into peptides or grafted onto 

25 proteins not normally targeted to the cell nucleus cause these peptides and proteins to be 

concentrated in the nucleus. See. for example. Dingwall, and Laskey. Ann. Rev. Cell Biol.. 2:367- 
390, 1986; Bonnerot. et al.. Proc. Natl. Acad. Sci. USA, 84:6795-6799. 1987; Galileo, et al., Proc. 
Natl. Acad. Sci. USA. 87:458-462. 1990. 

In a preferred embodiment, the targeting sequence is a membrane anchoring signal sequence. 
3 0 This is particularly useful since many parasites and pathogens bind to the membrane, in addition 
to the fact that many intracellular events originate at the plasma membrane. Thus, membrane- 
bound fluorettes are useful for both the identification of important elements in these processes as 
well as for the discovery of effective inhibitors. The invention provides methods for presenting the 
fluorettes extracellularly or in the cytoplasmic space. For extracellular presentation, a membrane 
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anchoring region is provided at the carboxy! terminus of the peptide presentation structure. In 
another embodiment, for extracellular presentation, a membrane anchoring region is provided at 
the amino terminus of the peptide presentation structure. Preferably, the amino terminal 
anchoring region functions as the anchoring region of a Type 2 glycoprotein. The fluorette is 
5 expressed on the cell surface and presented to the extracellular space, such that it can bind its 
target fluorophore. The binding of a fluorette, espescially when fused to a polypeptide or protein 
could confer a function on the cells expressing the fluorette. The cytoplasmic region could be 
neutral or could contain a domain that, when the fluorette is bound, confers a function on the cells 
(activation of a kinase, phosphatase, binding of other cellular components to effect function). 
10 Similarly, the fluorette could be contained within a cytoplasmic region, and the transmembrane 
region and extracellular region remain constant or have a deflned function. 

Membrane-anchoring sequences are well known in the art and are based on the geometry of 
mammalian transmembrane molecules. Peptides are inserted into the membrane based on a 
signal sequence (designated herein as ssTM) and require a hydrophobic transmembrane domain 
15 (herein TM). The transmembrane proteins are inserted into the membrane such that the regions 
encoded 5* of the transmembrane domain are extracellular and the sequences 3* become 
intracellular. Of course, if these transmembrane domains are placed 5* of the variable region, they 
will serve to anchor it as an intracellular domain, which may be desirable in some embodiments. 
ssTMs and TMs are known for a wide variety of membrane bound proteins, and these sequences 

2 0 may be used accordingly, either as pairs from a'particular protein or with each component being 

taken from a different protein, or alternatively, the sequences may be synthetic, and derived 
entirely from consensus as artificial delivery domains. 

As will be appreciated by those in the art, membrane-anchoring sequences, including both ssTM 
and TM, are known for a wide variety of proteins and any of these may be used. Particularly 
25 preferred membrane-anchoring sequences include, but are not limited to, those derived from CDS. 
1CAM-2. IL-8R, CD4 and LFA-1. 

Useful sequences include sequences from: 1) class I integral membrane proteins such as iL-2 
receptor beta-chain (residues 1-26 are the signal sequence, 241-265 are the transmembrane 
residues: see Hatakeyama et al.. Science 244:551 (1989) and yon Heijne et al, Eur. J. Biochem. 

3 0 174:671 (1988)) and insulin receptor beta chain (residues 1-27 are the signal, 957-959 are the 

transmembrane domain and 960-1382 are the cytoplasmic domain; see Hatakeyama, supra, and 
Ebina et al.. Cell 40:747 (1985)); 2) class II integral membrane proteins such as neutral 
endopeptidase (residues 29-51 are the transmembrane domain, 2-28 are the cytoplasmic domain; 
see Malfroy et al., Biochem. Biophys. Res. Commun. 144:59 (1987)); 3) type III proteins such as 



human cytochrome P450 NF25 (Hatakeyama. supra); and 4) type IV proteins such as human P- 
glycoprotein (Hatakeyama. supra). Particularly preferred are CDS and ICAM-2. For example, the 
signal sequences from CDS and ICAM-2 lie at the extreme 5* end of the transcript. These consist 
of the amino acids 1-32 in the case of CDS (MASPLTRFLSLNLLLLGESILGSGEAKPQAP; (SEQ 
5 ID NO:59) Nakauchi et al., PNAS USA S2:5126 (19S5) and 1-21 in the case of ICAM-2 

(MSSFGYRTLTVALFTLICCPG (SEQ ID NO:60); Staunton et a!.. Nature (London) 339:61 (1989)). 
These leader sequences deliver the fluorette to the membrane while the hydrophobic 
transmembrane domains, placed 3* of the fluorette region, serve to anchor the construct in the 
membrane. These transmembrane domains are encompassed by amino acids 145-195 from CDS 
10 (PQRPEDCRPRGSVKGTGLDFACDIYIWAPLAGICVALLLSLIITLICYHSR (SEQ ID NO:61); 

Nakauchi. supra) and 224-256 from ICAM-2 (MVIIVTWSVLLSLFVTSVLLCFIFGQHLRQQR (SEQ 
ID NO:62); Staunton, supra). 

Alternatively, membrane anchoring sequences include the GPI anchor, which results in a covalent 
bond between the molecule and the lipid bilayer via a glycosyl-phosphatidylinositol bond for 
15 example in DAF (PNKGSGTTSGTTRLLSGHTCFTLTGLLGTLVTMGLLT (SEQ ID NO:63). with 
the bolded serine the site of the anchor; see Homans et al.. Nature 333(6 170):269-72 (198S). and 
Moran et aL. J. Biol. Chem: 266:1250 (1991)). In order to do this, the GPI sequence from Thy-1 
can be cassetted 3* of the fluorette in place of a transmembrane sequence. 

Similarly, myristylation sequences can serve as membrane anchoring sequences. It is known that 
2 0 the myristylation of c-src recruits it to the plasma membrane. This is a simple and effective 
method of membrane localization, given that the first 14 amino acids of the protein are solely 
responsible for this function: MGSSKSKPKDPSQR (SEQ ID NO:64) (see Cross et a!., Mol. Cell. 
Biol. 4(9):1S34 (19S4); Spencer et al., Science 262:1019-1024 (1993), both of which are hereby 
incorporated by reference). This motif has already been shown to be effective in the localization of 

2 5 reporter genes and can be used to anchor the zeta chain of the TCR. This motif is placed 5* of the 

fluorette in order to localize it to the plasma membrane. Other modifications such as palmitoylation 
can be used to anchor fluorettes in the plasma membrane; for example, palmitoylation sequences 
from the G protein-coupled receptor kinase GRK6 sequence 

(LLQRLFSRQDCCGNCSDSEEELPTRL (SEQ ID NO:65). with the bold cysteines being 

3 0 palmitolyated; Stoffel et al.. J. Biol. Chem 269:27791 (1994)); from rhodopsin 

(KQFRNCMLTSLCCGKNPLGD (SEQ ID NO:66); Barnstable et al.. J. Mol. Neurosci. 5(3):207 
(1994)); and the p21 H-ras 1 protein (LNPPDESGPGCMSCKCVLS (SEQ ID NO:67); Capon et al.. 
Nature 302:33 (1983)). 

In a preferred embodiment, the targeting sequence is a lysozomal targeting sequence, including. 
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for example, a lysosomal degradation sequence such as Lamp-2 (KFERQ (SEQ ID NO:68); Dice, 
Ann. N.Y. Acad. Sci. 674:58 (1992); or lysosomal membrane sequences from Lamp-1 
( MLIPIAGFFALAGLVLIVLIAYL IGRKRSH AGyQT\ (SEQ ID NO:69), Uthayakumar et al.. Cell. Mol. 
Biol. Res. 41:405 (1995)) or Lamp-2 ILVPIAVGAALAGVULVLLAYF/G LKHHHAGYEQF (SEQ ID 
5 NO:70), Konecki et la., Biochem. Biophys. Res. Comm. 205:1-5 (1994), both of which show the 
transmembrane domains in italics and the cytoplasmic targeting signal underlined). 

Alternatively, the targeting sequence may be a mitrochondrial localization sequence, including 
mitochondrial matrix sequences (e.g. yeast alcohol dehydrogenase Ml; 
MLRTSSLFTRRVQPSLFSRNILRLQST (SEQ ID NO:71); Schatz. Eur. J. Biochem. 165:1-6 
10 (1987)); mitochondrial inner membrane sequences (yeast cytochrome c oxidase subunit IV; 
MLSLRQSIRFFKPATRTLCSSRYLL (SEQ ID NO:72); Schatz. supra); mitochondrial 
intermembrane space sequences (yeast cytochrome c1; 

MFSMLSKRWAQRTLSKSFYSTATGAASKSGKLTQKLVTAGVAAAGITASTLLYADSLTAEAMTA 
(SEQ ID NO:73); Schatz, supra) or mitochondrial outer membrane sequences (yeast 70 kD outer 
15 membrane protein; MKSFITRNKTAILATVAATGTAIGAYYYYNQLQQQQQRGKK (SEQ ID NO:74); 
Schatz, supra). 

The target sequences may also be endoplasmic reticulum sequences, including the sequences 
from calreticulin (KDEL; Pelham. Royal Society London Transactions B; 1-10 (1992)) or 
adenovirus E3/19K protein (LYLSRRSFIDEKKMP (SEQ ID NO:75); Jackson et al., EMBO J. 
20 9:3153(1990)). 

Furthernriore, targeting sequences also include peroxisome sequences (for example, the 
peroxisome matrix sequence from Luciferase; SKL; Keller et a!., PNAS USA 4:3264 (1987)); 
farnesylation sequences (for example, P21 H-ras 1; LNPPDESGPGCMSCKCVLS (SEQ ID 
NO:76), with the bold cysteine famesylated; Capon, supra)\ geranylgeranylation sequences (for 
25 example, protein rab-5A; LTEPTQPTRNQCCSN (SEQ ID NO:77), with the bold cysteines 

geranylgeranylated; Farnsworth. PNAS USA 91:11963 (1994)); or destruction sequences (cyclin 
B1; RTALGDIGN (SEQ ID NO:78); Klotzbucher et al., EMBO J. 1:3053 (1996)). 

In a preferred embodiment, the targeting sequence is a secretory signal sequence capable of 
^ effecting the secretion of the fluorette. There are a large number of known secretory signal 
3 0 sequences which are placed 5* to the fluorette sequence and are cleaved from the peptide region 
to effect secretion into the extracellular space. Secretory signal sequences and their 
transferability to unrelated proteins are well known, e.g., Silhavy. et al. (1985) Microbiol. Rev. 49. 
398-418. This is particularly useful to generate a fluorette that is expressed at the surface of a 
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cell. In a preferred approach, a fusion product is configured to contain, in series, secretion signal 
peptide-presentation structure-flourette-presentation structure. In this manner, target cells grown 
in the vicinity of cells caused to express the fluorettes, are bathed in secreted fluorette. Target 
cells exhibiting a phenotypic change in response to the presence of a fluorette, or internalization of 
5 the fluorette and binding to intracellular targets, are localized by any of a variety of methods, such 
as, FRET analysis (Selvin etal. Methods Enzymol. 246:300-334 (1995)) as described below. 

Suitable secretory sequences are known, including signals from IL-2 

(MYRMQLLSCIALSLALVTNS (SEQ ID NO:79); Villinger et al., J. Immunol. 155:3946 (1995)), 
growth hormone (MATGSRTSLLLAFGLLCLPWLQEGSAFPI (SEQ ID NO:80); Roskam et aL. 

10 Nucleic Acids Res. 7:30 (1979)); preproinsulin (MALWMRLLPLLALLALWGPDPAAAFVN (SEQ ID 
NO:81); Bell et al.. Nature 284:26 (1980)); and influenza HA protein (MKAKLLVLLYAFVAGDQl 
(SEQ ID NO:82); Sekiwawa et al.. PNAS 80:3563)), with cleavage between the non-underilned- 
underlined junction. A particularly preferred secretory signal sequence is the signal leader 
sequence from the secreted cytokine IL-4, which comprises the first 24 amino acids of IL-4 as 

15 follows: MGLTSQLLPPLFFLLACAGNFVHG (SEQ ID NO:83). 

In a preferred embodiment, the fusion partner is a rescue sequence. A rescue sequence is a 
sequence which may be used to purify or isolate the fluorette (or, in some cases, the nucleic acid 
encoding it). Thus, for example, peptide rescue sequences include purification sequences such as 
the HiSg tag for use with Ni affinity columns and epitope tags for detection, immunoprecipitation or 

2 0 FACS (fluorescence-activated cell sorting). Suitable epitope tags include myc (for use with the 

commercially available gE10 antibody), the BSP biotinylation target sequence of the bacterial 
enzyme BirA, flu tags, lacZ. and GST. 

In a preferred embodiment, a fluorette sequence functions as a rescue sequence. The fluorette 
because it binds a fluorophore dye can be used to purify or isolate a molecule to which it is fused 
25 or attached by affinity chromatography using fluorophore dye columns. If the fluorette alters the 
excitation or emission spectrum of the bound fluorophore dye, as described below, this difference 
can be used to monitor attachment and/or elution of the fluorette from the affinity column. 
Alternatively, the fluorette/dye complex can be used in FACS. If desired, a linker joining the 
fluorette to its fusion partner also contains a convenient site or sites to seperate the fluorette from 

3 0 the fusion partner, such as. a unique protease recognition sequences. Alternatively, the affinity 

column can contain antibody reactive with the fluorette. 



In a preferred embodiment, the fusion partner is a stability sequence to confer stability to the 
fluorette. Thus, for example, fluorettes may be stabilized by the incorporation of glycines after an 
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amino-terminal methionine (MG or MGGO). for protection of the fluorette to ubiquitination as per 
Varshavsky's N-End Rule, thus conferring long half-life in the cytoplasm. Similarly, two prolines at 
the C-terminus impart upon peptides resistance to carboxypeptidase action. The presence of two 
glycines prior to the prolines impart both flexibility and prevent structure initiating events in the di- 
5 proline to be propagated into the fluorette structure. Thus, preferred stability sequences are as 
follows: MG(X)„GGPP (SEQ ID NO:84), where (X)„ is a fluorette of any amino acid and n is an 
integer of at least about 8. as outlined above for fluorette length. 

In a preferred embodiment, the fusion partner is a dimerization sequence. A dimerization 
sequence allows the non-covalent association of one fluorette to another fluorette. with sufficient 
10 affinity to remain associated under normal physiological conditions. The dimers may be homo- or 
heterodimers. 

Dimerization sequences may be a single sequence that self-aggregates, or two sequences. That 
is. a first fluorette with dimerization sequence 1, and a second fluorette with dimerization sequence 
2, such that upon introduction into a cell and expression of the nucleic acids, dimerization 
15 sequence 1 associates with dimerization sequence 2 to form new random peptide structures. In 
addition, dimerization sequences can be used as presentation structures. That is, by putting a first 
dimerization sequence at one terminus of the fluorette. and a second dimerization sequence at the 
other terminus, similar to some other presentation structures, a "cyclized" fluorette can be made. 

Suitable dimerization sequences will encompass a wide variety of sequences. Any number of 
20 protein-protein interaction sites are known. In addition, dimerization sequences may also be 
elucidated using standard methods such as the yeast two hybrid system, and traditional 
biochemical affinity binding studies, or even using the present methods. 

The fusion partners may be placed anywhere (i.e. N-terminal. C-terminal. internal) in the structure 
as the biology and activity permits, although in general. N- or C-terminal fusions are preferred to 

2 5 fusions internal to the fluorette sequence. 

In another embodiment, a fluorette occupies an internal position within the fusion partner as the 
biology and activity permits. This will allow a means to measure access of the fluorette for its 
fluorophore dye as a means of determining structure and function of the region in which the 
fluorette resides. 

3 0 In a preferred embodiment, the fusion partner includes a linker or tethering sequence. Linker 
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sequences between various targeting sequences (for example, membrane targeting sequences) 
and the other components of the constructs and fiuorettes may be desirable to allow the candidate 
agents to interact with potential targets unhindered. For example, useful linkers include glycine- 
serine polymers (including, for example, (GS)n(SEQ ID NO:85), (GSGGS)^ (SEQ ID NO:86) and 
5 (GGGS)n (SEQ ID NO:87), where n is an integer of at least one), glycine-alanine polymers, 
alanine-serine polymers, and other flexible linkers such as the tether for the shaker potassium 
channel, and a large variety of other flexible linkers, as will be appreciated by those in the art. 
Glycine-serine polymers are preferred since both of these amino acids are relatively unstructured, 
and therefore may be able to serve as a neutral tether between components. Secondly, serine is 
10 hydrophilic and therefore able to solubilize what could be a globular glycine chain. Third, similar 
chains have been shown to be effective in joining subunits of recombinant proteins such as single 
chain antibodies. As is more fully outlined below, linkers such as these may also be used in some 
embodiments between fiuorettes and the moiety to which they are fused. 

In alternative embodiments linkers can also be sequences that are derived from other proteins and 
1 5 have no structure or an assumed structure. In another embodiment the linkers distance the 
fluorette from a molecule to which they are attached, for example, a fusion protein. 

In addition, the fusion partners, including presentation structures, may be modified, randomized, 
and/or matured to alter the presentation orientation of the fluorette. For example, determinants at 
the base of the loop may be modified to slightly modify the internal loop tertiary structure, which 
2 0 maintains the fluorette amino acid sequence. 

In a preferred embodiment, combinations of fusiori partnisrs are used. Thus, for example, any 
number of combinations of presentation structures, targeting sequences, rescue sequences, and 
stability sequences may be used, with or without linker sequences. 

In a preferred embodiment, the fluorette is fused to a target analyte. for example, to monitor or 

2 5 follow the target analyte, as is more fully described below. By "target analyte" or "analyte" or 

grammatical equivalents herein is meant any molecule, compound or particle to be detected. As 
will be appreciated by those in the art. a large number of analytes may be detected using the 
present methods; basically, any target analyte to which a fluorette may be attached may be 
detected using the methods of the invention. 

3 0 Suitable analytes include organic and inorganic molecules, including biomolecules. In a preferred 

embodiment, the analyte may be an environmental pollutant (Including pesticides, insecticides. 
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toxins, etc.); a chemical (including solvents, polymers, organic materials, etc.); therapeutic 
molecules (including therapeutic and abused drugs, antibiotics, etc.); biomolecules (including 
hormones, cytokines, proteins, lipids, carbohydrates, cellular membrane antigens and receptors 
(neural, hormonal, nutrient, and cell surface receptors) or their ligands, etc); whole cells (including 
5 procaryotic (such as pathogenic bacteria) and eukaryotic cells, including mammalian tumor cells); 
viruses (including retroviruses, herpesviruses, adenoviruses, lentiviruses. etc.); and spores; etc. 
Particularly preferred analytes are proteins. 

In a preferred embodiment, the target analyte is a protein and the fluorette is made as a fusion 
protein, using techniques well known in the art. For example, the nucleic acid encoding the 

10 fluorette is linked in-frame with a nucleic acid encoding a target protein to produce a fusion nucleic 
acid. Preferably, fusion proteins are produced by culturing a host cell transformed with an 
expression vector containing the fusion nucleic acid, under the appropriate conditions to induce or 
cause expression of the fusion protein. The conditions appropriate for fusion protein expression 
will vary with the choice of the expression vector and the host cell, and will be easily ascertained 

15 by one skilled in the art through routine experimentation. For example, the use of constitutive 
promoters in the expression vector will require optimizing the growth and proliferation of the host 
cell, while the use of an inducible promoter requires the appropriate growth conditions for 
induction. In addition, in some embodiments, the timing of expression is important. 

Alternatively, fluorettes can be produced by in vitro transcription of the encoding nucleic acid and 
2 0 translation of the RNA transcript, as known in the art. 

In a preferred embodiment, the target analyte is other than a protein, and the fusion of the fluorette 
to the target analyte is generally done chemically. In general, the fluorette and the target analyte 
are attached through the use of functional groups on each that can then be used for attachment. 
Preferred functional groups for attachment are amino groups, carboxy groups, oxo groups and 

2 5 thiol groups. These functional groups can then be attached, either directly or indirectly through the 
use of a linker. Linkers are well known in the art; for example, homo-or hetero-bifunctional linkers 
as are well known (see 1994 Pierce Chemical Company catalog, technical section on 
cross-linkers, pages 155-200, incorporated herein by reference). In an additional embodiment, 
carboxyl groups (either from the surface or from the candidate agent) may be derivatized using 

30 well known linkers (see the Pierce catalog). For example, carbodiimides activate carboxyl groups 
for attack by good nucleophiles such as amines (see Torchilin et al., Critical Rev, Therapeutic 
Drug Carrier Systems, 7(4):275-308 (1991), expressly incorporated herein). Proteinaceous 
species may also be attached using other techniques known in the art. for example for the 
attachment of antibodies to polymers; see Slinkin et al., Bioconj. Chem. 2:342-348 (1991); 
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Torchilin et a!., supra; Trubetskoy et al.. Bioconj. Chem. 3:323-327 (1992); King et al., Cancer 
Res. 54:6176-6185 (1994); and Wilbur et al.. Bioconjugate Chem. 5:220-235 (1994), all of which 
are hereby expressly incorporated by reference). It should be understood that the fluorettes and 
target analytes may be attached in a variety of ways, including those listed above. What is 
5 important is that manner of attachment does not significantly alter the functionality of the fluorette 
(and preferably the target analyte as well); that is. the attachment should be done in such a flexible 
manner as to allow the interaction of the fluorette peptide with a dye, and the target analyte to its 
binding partners. 

In a preferred embodiment the fluorette will bind to Texas Red. Suitable fluorettes in this 
10 embodiment include, but are not limited to, KHVQYWTQMFYS (SEQ ID NO:1); DFLQWKLARQKP 
' (SEQ ID NO:2); KPVQYWTQMFYT (SEQ ID NO:15); KPAQYWTQMFYS (SEQ ID NO:16); 

KNVQYWTQMFYT (SEQ ID NO:17); KHVQYWTHMFYT (SEQ ID NO:18); KHVQYWTQMFYT 

(SEQ ID NO:19); NHVHYWTQMFYS (SEQ ID NO:20); THVQYWTQMFYS (SEQ ID NO:21); 

RTIWEPKEASNHT (SEQ ID NO:105): WSKMGHTVT (SEQ ID NO:106); RWTWEPISE (SEQ ID 
15 NO:107); GNQKCLQHNRCST (SEQ ID NO:108); SQTWSFPEH (SEQ ID NO:109); 

EPMARPWERKQDR (SEQ ID NO:110); and GTLSATRPYGRQW (SEQ ID NO:111). 

In a preferred embodiment the fluorette will bind to Rhodamine. Suitable fluorettes In this 
embodiment include, but are not limited to. IPHPPMYWTRVF (SEQ ID NO:3); IPHRPMYWTPVF 
(SEQ ID NO:22); and LPHPPMYV\n"RVF (SEQ ID NO:23). 

2 0 In a preferred embodiment the fluorette will bind to Oregon Green 514. Suitable fluorettes in this 
embodiment include, but are not limited to, HGWDYYWDWTAW (SEQ ID NO:4); 
ASDYWDWEWYYS (SEQ ID NO:5); YPNDFEWWEYYF (SEQ ID NO:6); HTSHiSWPPWYF (SEQ 
ID NO:7); LEPRWGFGWWLK (SEQ ID NO:8); QYYGWYYDHNFW(SEQ ID NO:9); 
YMYDEYQYWNFW(SEQ ID NO:10); HEWEYYWDWTAW (SEQ ID NO:24); HEWDYYWDWTAW 

2 5 (SEQ ID NO:25); HGWDYYWDWTDW (SEQ ID NO:26); HGWDYYWDV\n-pW (SEQ ID NO:27); 

HGWDYYWDWTTW (SEQ ID NO:28); HGWDYNWDWTAW (SEQ ID NO:29); and 
QGWDYYWDWTAW (SEQ ID NO:30). 

In a preferred embodiment the fluorette will bind to Fluorescein. Suitable fluorettes in this 
embodiment include, but are not limited to, YPNDFEWWEYYF (SEQ ID NO:6); 

3 0 ASDYWDWEWYYS (SEQ ID NO:5); WYDDWNDWHAWP (SEQ ID NO:1 1); WHMSPSWGWGYW 

(SEQ ID NO:12); HMSWWEFYLVPP (SEQ ID NO:13); YWDYSWHYYAPY (SEQ ID NO:14); 
YPNEFDWWDYYY (SEQ ID NO:31); YPNDFEWWDYYY (SEQ ID NO:32); YHNDYEWWEYYY 
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(SEQ ID NO:33); YPNDFEWWEYYY (SEQ ID NO:34); YPNDFDWWEYYL (SEQ ID NO:35); 
YTHDYEWWEYYF (SEQ ID NO:36); YPNDYE\AAA/EYYF (SEQ ID NO:37); YPSDFEWWEYYF 
(SEQ ID NO:38); YHDFE\AAA/EYYF (SEQ ID NO:39); and YPYDFEWWEYYM (SEQ ID NO:40). 

Included within the definition of fluorettes are derivative or variant fluorettes. Accordingly, as used 
5 herein, a peptide is a fluorette if the overall homology of the peptide sequence to the amino acid 
sequences shown above (SED ID NOS 1-40 and 104-1 10) is preferably greater than about 70%, 
more preferably greater than about 75% even more preferably greater than about 80% and most 
preferably greater than 90%, with homologies of greater than 95 to 98% being especially 
preferred. Homology in this context means sequence similarity or identity, with identity being 
1 0 preferred. 

This homology will be determined using standard techniques known in the art. including, but not 
limited to, the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981). by 
the homology alignment algorithm of Needleman & Wunsch, J. MoL Biool. 48:443 (1970). by the 
search for similarity method of Pearson & Lipman, PNAS USA 85:2444 (1988). by computerized 

15 implementations of these algorithms (GAP. BESTFIT. FASTA. and TFASTA in the Wisconsin 
Genetics Software Package, Genetics Computer Group. 575 Science Drive, Madison, Wl), the 
Best Fit sequence program described by Devereux etaL, Nucl. Acid Res. ^2:387-395 (1984). 
preferably using the default settings, or by inspection. Preferably, percent identity is calculated by 
FastDB based upon the following parameters: mismatch penalty of 1 ; gap penalty of 1 ; gap size 

2 0 penalty of 0.33; and joining penalty of 30. "Current Methods in Sequence Comparison and 
Analysis." Macromolecule Sequencing and Synthesis. Selected Methods and Applications, pp 
127-149 (1988). Alan R. Liss, Inc. 

One example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence alignment 
from a group of related sequences using progressive, pairwise alignments. It can also plot a tree 

2 5 showing the clustering relationships used to create the alignment. PILEUP uses a simplification of 

the progressive alignment method of Feng & Doolittle. J. Mol. Evol. 35:351-360 (1987); the 
method is similar to that described by Higgins & Sharp CABIOS 5:151-153 (1989). Useful PILEUP 
parameters including a default gap weight of 3.00. a default gap length weight of 0.10, and 
weighted end gaps. 

3 0 Another example of a useful algorithm is the BLAST algorithm, described in Altschul et al.. J, Mol. 

Biol. 215, 403-410. (1990) and Karlin et al.. PNAS USA 90:5873-5787 (1993). A particularly useful 
BLAST program is the WU-BLAST-2 program which was obtained from Altschul et al.. Methods in 
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EnzvmoloQv . 266: 460-480 (1996); http://blast.wustl/edu/blast/ README.html]. WU-BLAST-2 
uses several search parameters, most of which are set to the default values. The adjustable 
parameters are set with the following values: overlap span =1, overlap fraction = 0.125, word 
threshold (T) = 11, The HSP S and HSP S2 parameters are dynamic values and are established 
5 by the program itself depending upon the composition of the particular sequence and composition 
of the particular database against which the sequence of interest is being searched; however, the 
values may be adjusted to increase sensitivity. A % amino acid sequence identity value is 
determined by the number of matching identical residues divided by the total number of residues 
of the "shorter" sequence in the aligned region. The "longer" sequence Is the one having the most 
10 actual residues in the aligned region (gaps introduced by WU-BIast-2 to maximize the alignment 
score are Ignored). 

An additional useful algorithm is gapped BLAST as reported by Altschul etaL Nucleic Acids Res. 
25:3389-3402. Gapped BLAST uses BLOSUM-62 substitution scores; threshold 7 parameter set 
to 9; the two-hit method to trigger ungapped extensions; charges gap lengths of /c a cost of 10+/c; 
15 set to 16. and Xg set to 40 for database search stage and to 67 for the output stage of the 
algorithms. Gapped alignments are triggered by a score corresponding to -22 bits. 

A percent amino acid sequence identity value is determined by the number of matching identical 
residues divided by the total number of residues of the "longer" sequence in the aligned region. 
The "longer" sequence is the one having the most actual residues in the aligned region (gaps 
2 0 introduced by WU-Blast-2 to maximize the alignment score are ignored). 

Alternatively, percent sequence identity is determined by inspection in which only identities are 
scored positively (+1) and all forms of sequence variation including gaps are assigned a value of 
"0". Percent sequence identity can be calculated, for example, by dividing the number of matching 
Identical residues by the total number of residues of the "shorter" sequence in the aligned region 
2 5 and multiplying by 100. The "longer sequence is the one having the most actual residues in the 
aligned region. 

Once identified, the amino acid sequences of the fluorettes may be modified as needed. 
Sequence modifications include substitutions, deletions, insertions or any combination thereof may 
be used to arrive at a final derivative. Generally these changes are done on a few amino acids to 
30 minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the fluorette are desired, 
substitutions are generally made in accordance with the following chart: 
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Original Residue 



Chart I 

Exemplary Substitutions 



Ala 
Arg 
5 Asn 



Ser 
Lys 



Gin. His 



10 Gly 



Asp 
Cys 
Gin 
Glu 



Glu 
Ser 
Asn 
Asp 
Pro 



His 
lie 
Leu 
Lys 



Asn, Gin 
Leu. Val 
lie. Val 
Arg, Gin, Glu 
Leu, Me 
Met, Leu, Tyr 



15 Met 



Phe 
Ser 
Thr 
Trp 



Thr 
Ser 
Tyr 



2 0 Tyr 
Val 



Trp, Phe 
lie. Leu 



Substantial changes in function are made by selecting substitutions that are less conservative 
than those shown in Chart I, although these generally are not preferred. For example, 
substitutions may be made which more significantly affect the structure of the fluorette backbone 

25 in the area of the alteration, for example the alpha-helical or beta-sheet structure; the charge or 
hydrophobicity of the molecule; or the bulk of the side chain. The substitutions which in general 
are expected to produce the greatest changes in the fiuorette's properties are those in which (a) a 
hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g. 
leucyl, isoleucyl. phenylalanyl, valy! or alanyl; (b) a cysteine or proline is substituted for (or by) any 

3 0 other residue; (c) a residue having an electropositive side chain, e.g. lysyl, arginyl. or histidyl, is 
substituted for (or by) an electronegative residue, e.g. glutamyl or aspartyl; or (d) a residue having 
a bulky side chain, e.g. phenylalanine, is substituted for (or by) one not having a side chain, e.g. 
glycine. 

In a preferred embodiment, not more than about three substitutions or deletions will be made, and 
35 that the change will not be more than about 20 number %, usually not more than about 10 number 
%. of the number of amino acids in the fluorette, although in some instances higher numbers of 
alterations may be made. 

Similarly, if the function or affinity of the fluorette is to be decreased, the amount of changes may 
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also be greater. Preferred are conservative substitutions, as known in the art, including 
substitutions within the large hydrophobic group: isoieucine, leucine, valine and phenylalanine; 
between serine and threonine; glycine and alanine; asparagine and glutamine; aspartic acid and 
glutamic acid; or lysine, arginine and histidine. In some embodiments, non-conservative 
5 alterations are done. 

As will be appreciated by those skilled in the art, a number of methods exist for the creation, 
isolation, or selection of fluorettes from a library of sequences (Oldenburg et al. Proc. Natl. Acad. 
Sci. U.S.A. 89:5393-5397 (1992); Scott etaL, Proc. Natl. Acad. Sci. U.S.A. 89:5398-5402 (1992); 
Cull etaL, Proc. Natl. Acad. Sci. U.S.A. 89:1865-1869 (1992); Devlin etaL Science 249:404-406 

10 (1990); Schatz. Biotechnology 11:1138-1143 (1993)). In preferred embodiments, such libraries 
are based on, for example, either bacteriophage (Matthews and Wells. Science 260:1 1 13-1 117 
(1993); Kolvunen etaL Biotechnology 13:265-270 (1995); Yu and Smith. Methods Enzymol. 267:3- 
27 (1996); Harrison etaL Methods Enzymol. 267:83-109 (1996); Rebar and Pabo. Science 
263:671-673 (1994); Rebar etaL, Methods Enzymol, 267:129-149 (1996); Sagglo and Laufer. 

15 Biochem. J. 293:613-616 (1993); Katz. Biochemistry 34:15421-15429 (1995); Caparon etaL Mol. 
Divers. 1:241-246 (1996); Parmley and Smith. Gene 73:305-318 (1988)). for example M13 or 
plasmids (Cull etaL Proc. Natl. Acad. Sci. U.S.A. 89:1865-1869 (1992); Schatz etaL Methods 
Enzymol. 267:171-191 (1996); ) or polysomes (Mattheakis etaL Proc. Natl. Acad. Sci. U.S.A. 
91:9022-9026; Hanes and Pluckthun Proc. Natl. Acad. Sci. U.S.A. 94: 4937-4942; Roberts and 

20 Szostak. Proc. Natl. Acad. Sci. U.S.A. 94:12297-12301)) Alternatively, peptide libraries can be 
expressed on the surface of a eukaryotic or prokaryotic cell and directly bound to a fluorophore 
dye and sorted, for example, by FACS. Alternatively, the cell expressing a fluorette is bound to a 
fluorophore dye that is bound to bead or a magnetic bead and purified and amplified as known in 
the art. 

25 In a preferred embodiment, fluorettes are created, isolated, or selected from aptamer library phage 
display approaches. Such libraries are based on, for example. Ml 3 filamentous bacteriophage 
(Smith.Science. 228:1315-1317 (1985); Cwiria etaL Proc. Natl. Acad. Sci. U.S.A. 87:6378-6382 
(1990); Devlin etaL Science 249:404-406 (1990); Scott and Smith. Science 249:386-390 (1990)) 
In a preferred embodiment, fluorettes are identified by screening a phage display peptide library 

3 0 that contains a combinatorial library of peptides interposed betwen the leader sequence and the 
amino terminus of mature form of bacteriophage M13amp9 minor coat protein, pill. 

The recombinatorial library can be produced by a number of methods as known in the art. In a 
preferred embodiment, the library is produced by annealing and cloning an oligonucleotide library 
of sequences into the M13amp9 pill gene. Annealing of the oligonucleotide library to produce 



double-stranded DNA preferably produces "sticky ends" to facilitate directional cloning into the 
appropriate reading frame of the M13amp9. Alternatively, the annealed oligonucleotide library 
contains unique restriction enzyme sites, which are digested to produce "sticky ends" to facilitate 
directional cloning. The oligonucleotide library preferably also encodes an In-frame linker. 
5 tethering sequence, or spacer as described herein. The function of the linker is to decrease or 
minimize sterlc hindrance and promote accessibility of the fluorette for its fluorophore dye. The 
linker also promotes structural and functional independence of the fluorette. Phage particles 
bearing the peptide library are produced by introducing the M13amp9 vector into a permissive 
bacterial host, such as, Escherichia co// and other gram negative bacteria. According to the above 
10 strategy for cloning the peptide library, during phage maturation, the leader secretory sequence of 
pill is removed, resulting in the peptide library sequence positioned immediately at the amino- 
terminus of the mature protein. 

Cloning and expression of the oligonucleotide library produces a library of peptides either directly 
fused to the pill protein or, preferably, fused via a linker or spacer. The linker or spacer can be, 
15 for example, sequence, as described herein, interposed between the fluorette and the pill protein 
sequences. 

The nucleic acids which give rise to the fluorettes are chemically synthesized, and thus may 
incorporate any nucleotide at any position. Thus, when the nucleic acids are expressed to form 
peptides, any amino acid residue may be incorporated at any position. The synthetic process can 
20 be designed to generate randomized nucleic acids, to allow the formation of all or most of the 
possible combinations over the length of the nucleic acid, thus forming a library of randomized 
nucleic acids and peptides. In an alternative embodiment, random DNA fragments may be made 
from fragmentation of genomic DNA. followed by sizing, and ligation into an appropriate vector for 
expression and selection. 

25 The library should provide a sufficiently structurally diverse population of randomized expression 
products to effect a probabilistically sufficient range to allow isolation of a fluorette. Accordingly, a 
library must be large enough so that at least one of its members will have a structure that gives it 
affinity for a fluorophore dye. Although it is difficult to gauge the required absolute size of a library, 
nature provides a hint with the immune response: a diversity of 10^-10® different antibodies 

3 0 provides at least one combination with sufficient affinity to interact with most potential antigens 
faced by an organism. Published in vitro selection techniques have also shown that a library size 
of 10^ to 10® is sufficient to find structures with affinity for the target. A library of all combinations 
of a peptide from about 7 to 20 amino acids in length has the potential to code for 20^ (10^) to 20^°. 
Thus, with libraries of 10^ to 10® per ml of phage particles the present methods allow a "working" 



subset of a theoretically complete interaction library for 7 amino acids, and a subset of shapes for 
the 20^° library. Thus, in a preferred embodiment, at least 10®. preferably at least 10®. more 
preferably at least 10^° and most preferably at least 10^^ different expression products are 
simultaneously analyzed in the subject methods. Preferred methods maximize library size and 
5 diversity. 

It is Important to understand that in any library system encoded by oligonucleotide synthesis one 
cannot have complete control over the codons that will eventually be incorporated into the peptide 
structure. This is especially true in the case of codons encoding stop signals (TAA, TGA. TAG). 
In a synthesis with NNN as the random region, there is a 3/64. or 4.69%, chance that the codon 

10 will be a stop codon. Thus, in a peptide of 10 residues, there is an unacceptable high likelihood 
that 46.7% of the peptides will prematurely terminate. For free peptide structures this is perhaps 
not a problem. But for larger structures, such as those envisioned here, such termination will lead 
to sterile peptide expression. To alleviate this, random residues are encoded as NNK, where K= T 
or G. This allows for encoding of all potential amino acids (changing their relative representation 

15 slightly), but importantly preventing the encoding of two stop residues TAA and TGA. Thus, 
libraries encoding a 10 amino acid peptide will have a 15.6% chance to terminate prematurely. 
For candidate nucleic acids which are not designed to result in peptide expression products, this is 
not necessary. In one embodiment, the library is fully randomized, with no sequence preferences 
or constants at any position. 

2 0 Once generated a peptide library is screened for binding to a fluorescent dye. In a preferred 
embodiment, bacteriophage expressing fluorettes are selected by biopanning. which consists of 
four sequential steps: i) phage binding with a fluorophore dye carrier, ii) removal of unbound or 
weakly bound phage, iii) elution of bound phage and (iv) amplification of bound phage. Certain 
steps can be omitted, for example, such as washing and amplification if the peptide library is 

25 expressed in eukaryotic or mammalian cells and fluorescence is detected and cells are shorted 
according to fluorescence or spectral shifts. If necessary or desirable, the amplified phage are 
used for the subsequent round(s) of biopanning against the corresponding fluorophore dye carrier 
until apparent enrichment for binding is observed over background. Preferably, after about four 
rounds of biopanning, phage bearing sequences that bind specifically to a fluorophore dye can be 

30 identified. Following biopanning, the individual phage are isolated, amplified and the sequence of 
the fluorette can be determined by various methods as known in the art. In another embodiment, 
eukaryotic or prokaryotic cells expressing a peptide library on their surface can be sorted by FACS 
and/or magnetic beads to isolate cells expressing fluorettes. 

For biopanning, the fluorophore dyes can be bound to solid-phase carriers. In a preferred 
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embodiment, the dyes are covalently attached to a target bead. This facilitates washing and 
removal of non-specifically bound or weakly bound phage. Preferably, a linker or spacer molecule 
is interposed between the fluorophore dye and the target bead, for example, succinimidy esters 
and derivatives of fluorophores. A spacer molecule increases the accessibility of the fluorophore 
5 dye and minimizes potential steric hindrance that may interfere with the interactions between the 
fluorophore dye and the bacteriophage particles that bear the peptide library. The number of 
fluorophore dye molecules bound to the carrier will vary. In general, there are up to about 1 
micromole. Higher or lower amounts of dye can be attached per ml of carrier beads, as needed, 
for example to increase or decrease the stringency of the biopanning conditions. 

10 Once a fluorette is identified, it can be mutagenized or derivatized to isolate fluorettes with altered 
properties. In a preferred embodiment, the derivatize fluorettes have a higher affinity or specificity, 
thereby, providing, for example, increased sensitivity for the assay or system in which the fluorette 
is employed. This also allows the use of lower amounts of fluorescent dye, thereby, limiting 
toxicity of the fluorescent dye to cells, if needed. 

In a preferred embodiment, affinity maturation (Yu and Smith, supra) can be used to produce 
mutant or derivative fluorettes with improved affinities over their corresponding parental fluorettes. 
To create, select or isolate fluorettes with a higher affinity, a second oligonucleotide library is 
produced that is. preferabbly, based. That Is, some positions within the sequence are either held 
constant, or are selected from a limited number of possibilities. For example, in a preferred 
embodiment, the nucleotides or amino acid residues are randomized within a defined class, for 
example, of hydrophobic amino acids, hydrophilic residues, stericatly biased (either small or large) 
residues, towards the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, 
threonines, tyrosines or histidines for phosphorylation sites, etc., or to purines, etc. 

In a preferred embodiment, the amino acid sequences of the fluorettes are subjected to biased 
mutagenesis at a desired mutagenesis rate using a number of methods as known in the art. The 
sequences can be mutagenized by systemmatic alteration of the codons encoding the fluorettes. 
Oligonucleotide mutagenesis, is preferablly. performed during the synthesis of the oligonucleotide 
library. To increase the probability that all amino acids will be represented at each position and to 
limit the generation of stop codons within the fluorette encoding sequences, specific nucleotides, 
such as A and C, can be omitted from the third position of each codon. As described above, the 
oligonucleotide library is preferably annealed and cloned into an expression vector of choice. 

To select fluorettes with higher binding affinity than the parental fluorette, the biopanning 
conditions are preferably altered to be more stringent by. for example, changing one or more of 
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the following conditions. This also provides a method to maximize selection against the parental 
fluorette. To increase the stringency of the biopanning, the concentration of the fiuorophore dye 
bound to the carrier beads can be decreased. In addition, the phage concentration represented in 
the binding step and/or the binding time also can be reduced. Alternatively, the stringency of the 
5 washing conditions is increased. Washing stringency can be increased by. for example, 

increasing the volume of washing buffer per ml of carrier beads, increasing the wash temperature, 
altering the ionic strength or pH of the wash buffer, increasing the detergent concentration or by 
using a stronger, ionizing detergent. (Sambrook et ai, (1989) Molecular Cloning: A Laboratory 
Manual (2nd Edition). Cold Spring Harabor Laboratory, Cold Spring Harbor. NY). Other 
10 parameters that can be altered to increase stringency are. for example, increasing the 
temperature and competition for binding to the fiuorophore dye by related molecules. 

Fluorette binding to a fiuorophore dye can be assessed In several ways. For example, when a 
phage display system is used, binding of the bacteriophage-bearing peptide library to the 
fiuorophore dye-carrier can be determined by comparing the infectious particle titers of input 

15 phage to eluted phage. To measure phage-fluorophore dye binding in solution, a known 

concentration of free fiuorophore dye is mixed with a known number of purified phage particles. 
The number of particles is preferably calculated from the optical densities of the purifed phage at 
260 nm and 280 nm but other methods, such as. infectious titer can be used. Phage-fluorophore 
dye complexes are separated by. for example, precipitation to remove unbound dye. spotted onto 

20 a solid-support filter, such as. nitrocellulose and scanned for fluorescence. Other methods to 
remove unbound dye include, for example, competition with related molecules, washing, and 
Increased stringency To determine nonspecific binding, controls using phage particles that do not 
display recombinatorial library sequences are run in parallel. In addition, specific to nonspecific 
binding ratios also can be quantitated. 

25 Fluorette/fluorophore dye dissociation constants can be determined in several ways. For 

example, when a phage display system is used, phage-fluorophore dye dissociation constants can 
be determined by. for example, incubating for several hours subsaturating amounts of phage with 
a fiuorophore dye bound to a carrier, for example, a bead as described above. The bead 
suspensions are removed, for example, by centrifugatlon, and the unbound phage in the 

3 0 supernatants can be titrated. Other methods for measuring unbound phage are known In the art. 
for example, visualization. Dissociation constants can be measured via a standard linear 
Scatchard plot. Nonspecific background binding is determined using phage that does not express 
fluorette sequences. 

In a preferred embodiment, fluorettes are encoded by nucleic acids and produced by recombinant 
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molecular biology techniques (Sambrook etal. (1989) Molecular Cloning: A Laboratory Manual 
(2nd Edition) Cold Spring Harbor Laboratory, Cold Spring Harbor, NY; Ausubel etaL (1994) 
Current Protocols in Molecular Biology. Massachusettes General Hospital and Harvard Medical 
School. Boston. MA). Thus, the present invention provides nucleic acids encoding fluorettes. By 
5 "nucleic acids" herein is meant a nucleic acid. DNA or RNA depending on the delivery or 
expression vehicle or vector used, which can be manipulated to express fluorettes; that is, the 
nucleic acids encoding the fluorettes and the fusion partners and linkers if present. In addition, the 
nucleic acids will also generally contain enough extra sequence to effect translation or 
transcription, as necessary. For a fluorette. the nucleic acid generally contains cloning sites which 

1 0 are placed to allow In frame expression of the fluorettes. and any fusion partners, if present, such 
as presentation structures. For example, when presentation structures are used, the presentation 
structure will generally contain the ihitating ATG, as a part of the parent vector. For example, for 
retrovirus expression, the nucleic acids are generally constructed with an internal CMV promoter, 
tRNA promoter or cell specific promoter designed for immediate and appropriate expression of the 

15 fluorette at the initiation site of RNA synthesis. The RNA is expressed anti-sense to the direction 
of retroviral synthesis and is terminated as known, for example with an orientation specific 
terminator sequence. Interference from upstream transcription is alleviated in the target cell with 
the self-inactivation deletion, a common feature of certain retroviral expression systems. In 
another embodiment, other expression vectors or systems also can be used, such as, adenovirus, 

2 0 adenoassociated virus, or transposons. 



Generally, the nucleic acids are introduced and expressed within the cells to produce fluorettes. 
By "introduced into " or grammatical equivalents herein is meant that the nucleic acids enter the 
cells in a manner suitable for subsequent expression of the nucleic acid. The method of 
introduction is largely dictated by the targeted cell type, discussed below. Exemplary methods 

2 5 include CaP04 precipitation, liposome fusion, lipofectin®, electroporation, viral infection, etc. The 

candidate nucleic acids may stably integrate into the genome of the host cell (for example, with 
retroviral introduction), or may exist either transiently or stably in the cytoplasm (i.e. through the 
use of traditional plasmids, utilizing standard regulatory sequences, selection markers, etc.). As 
many pharmaceutically important screens require human or model mammalian cell targets, 

3 0 retroviral vectors capable of transfecting such targets are preferred. 



In a preferred embodiment the fluorettes functions within the cell in which they are expressed. 
Alternatively, fluorettes may be expressed within a cell or secreted from a cell, and purified. If 
desired, the fluorette can be introduced into another cell by electroporation. calcium phosphate 
precipitation, microinjectioin. liposome fusion, lipofectin, viral infection, etc. Fluorettes also may be 
3 5 produced in vitro by transcription/translation reactions. Alternatively, fluorettes are chemically 
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synthesized using any of a number of methods. 



In a preferred embodiment, the present invention provides a complex or composition of a fluorette 
bound to a fluorophore dye. The binding of the fluorette to the fluorophore dye leads to four 
possible outcomes in regards to the excitation and emission spectra of the fluorophore dye i) little 
5 to no change in the fluorophore emission or excitation spectra, ii) change in fluorophore emission, 
iii) change in fluorophore excitation and (iv) change in both fluorophore emission and excitations. 

In one embodiment, binding of the fluorette to the fluorophore dye may prevent fluorophore 
excitation or emission. In either scenario, the fluorophore dye, no longer fluoresces when bound 
to the fluorette. In another embodiment, the excitation spectrum and/or the emission spectrum of 
10 the fluorophore dye Is shifted, to a shorter or longer wavelength. Alternatively, the excitation 
spectrum and/or emission spectrum can be broadened. 

In a preferred embodiment the excitation spectrum is altered up to about 5 nm, about 10 nm. being 
preferred, 20 nm particularly preferred, and about 100 nm being especially preferred. 

Fluorettes in their various forms find use in detector systems and various types of assays and 
15 techniques using a large set of potential fluorophore dyes, for example as. in detection of viral 
delivery systems, diagnostics, high-throughput assays. 

Once made, the compositions find use in a number of applications. These applications include 
binding a fluorophore dye to the fluorette, removing unbound dye and detecting fluorescence. In a 
preferred embodiment, the emission and/or excitation spectrum of the fluorophore dye is altered 
20 when bound to a fluorette. Therefore, it is not necessary to remove unbound dye. In a preferred 
embodiment, are applications that in some manner tethers the excitation and emission of two or 
more flourette/fluorescent dye complexes in such a way that they can be detected by fluorescence 
resonance energy transfer (FRET). 

FRET applications based on fluorette/fluorophore dye interactions provide methods to 
2 5 demonstrate the proximity or interactions of two or more molecules. These molecules include, for 
example, biological molecules, such as proteins, lipids, nucleic acids, carbohydrates. In another 
embodiment. FRET can be applied to detect or induce the localized activation of a produrg 

In a preferred embodiment, the interactions of two or more proteins, each containing a unique 
fluorette sequence can be demonstrated as follows. First, the fluorophore dyes are bound by their 
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corresponding fluorettes. Next, one of the fluorette/fluorophore dye complexes or donor complex 
is excited and fluoresces. When the two proteins containing the fluorette sequences are in close 
proximity or in some way have interacted, the emitted light of the first donor complex causes the 
second fluorette/fluorophore dye complex (acceptor) to fluoresce. 

5 Another example of a FRET application employs fluorette/fluorophore dye complexes that are 
covalently or associately linked to target biomaterials via small ligands. The demonstration of 
FRET as described above indicates an interaction or the proximity of the target biomaterials when 
bound to their respective ligands. Other applications include, "trigger dependent FRET", in which 
the use and design of trigger molecules, such as, heavy metal ions, drugs, etc., that specifically 
10 bind to the ligand or target biomaterials, causing dissociation of one or more of the fluorette/dye 
complexes from the target biomaterials and a decrease or loss of FRET. 

FRET may also be employed to detect intramolecular Interactions or events, such as, changes in 
molecular structure that occurs in a molecule with the binding and/or dissociation of a ligand. 

In FRET applications, double or higher order fluorettes may be linked, for example, via a rigid 
15 connector, such as polyalanine or polyserine, to provide the correct angle of orientation of 
corresponding donor and acceptor fluorette/fluorophore dye complexes. This may be of 
importance for efficient and quantitative FRET output. In some cases, it may be desirable for the 
spacer to contain a peptide sequence or other material which, upon binding a specific fluorophore 
dye, brings the donor and acceptor comlexes into proximity such that FRET can ensue. 

2 0 FRET applications also include dual fluorophore dye quenching due to fluorette binding. For 
example, fluorettes specific for donor and acceptor fluorophores are joined by a protease 
cleavable spacer. This fluorette dimer is bound to its two fluorophore dyes targets and FRET 
ensues as described above. However, FRET decreases after protease cleavage of the fluorette 
dimer. 

2 5 Other novel assays are based on fluorophore dye quenching due to its binding of a specific 

fluorette. For example, a target protein is labeled in vitro by a fluorophore dye and is assembled 
into a complex particle or structure with other materials (e.g., other proteins, DNA or RNA) whose 
conformation or structure is to be investigated. The addition of a specific fluorette after the 
complex Is assembled and measuring fluoresence indicates if the fluorette had access (binding) or 

3 0 no access (no binding) to its target dye. 
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Novel reporter gene assays based on fluorette-induced Stokes shift changes also can be created. 
For example, a fluorette fused with a given carrier protein can be used as a gene activity reporter 
for the presence of the protein. Cell extracts, or living cells expressing the protein, can be 
monitored for fluorette activity when the specific fluorophore dye is bound by the peptide. Binding 
5 of the fluorophore dye by the fluorette/peptide modifies the excitation and/or emitted wavelength 
permitting specific detection of the expressed protein. Consequently, the more Stokes shift 
changes introduced by binding of peptide the more sensitive the assay can become. The 
simplicity of such an approach surpasses the currently available enzymatic reporter gene assays 
(e.g.. luciferase and beta-galactosidase assays). 

10 The following examples serve to more fully describe the manner of using the above-described 
invention, as well as to set forth the best modes contemplated for carrying out various aspects of 
the invention. It is understood that these examples in no way serve to limit the true scope of this 
invention, but rather are presented for illustrative purposes. All references, patents, patent 
applications, and publications cited herein are incorporated by reference in their entirety. 

15 EXAMPLE I 

Synthesis of Fluorophore Dye Carriers 
Four fluorophore dyes, Texas Red, Fluorescein, Rhodamine Red and Oregon Green 514, were 
chosen, for selection of fluorettes from a phage display library (Example 2). The chemical 
structures of the dye conjugates are shown in Figure 1 . The first three dyes, Texas Red. 

20 Fluorescein and Rhodamine Red fall into one group: (I) they have a relatively high molar extinction 
coefficient for absorption and excellent quantum fluorescence yields (Haugland supra), (ii) they 
have non-overlapping spectral characteristics, (iii) they have the potential for cross-quenching 
and/or FREt analysis. The fourth dye. Oregon Green 514, related structurally to Fluorescein, was 
chosen to detemnine if structure-function relationships could be discerned between fluorette 

2 5 peptides that bound related dyes. 

The extinction coefficient and quantum yield is important for sensitivity of detection and was a 
major factor determining choice of fluorophores. Fluorescence detection by the outlined approach, 
at its simplest conception, is stochiometric in nature. In addition, peptide binding of the target dye 
might lead to some fluorescence quenching. Finally, if fluorettes can be created that each can 

3 0 specifically bind different dyes then it is possible to measure proximal interactions between 

molecules by FRET relative to the binding of specific dyes having accommodating overlapping 
fluorescence spectra. 
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Succinimidyl esters of the four chosen fluorophore dyes (with 3- or 7-atom spacer between the 
dye molecule and the reactive succinimidyl group) were separately and covalently linked to target 
beads. Linkages were established through the formation of a stable peptide bond to UltraLink 
Immobilized DADPA carrier beads (capacity - 45pMoles of free amino groups per ml of beads) 
5 containing a 12-atom diaminodipropylamine spacer and a terminal amino-group (Pierce). 

Activated derivatives of fluorophore dyes: 6-(fluorescein-5-(and-6)-carboxamido)hexanoic acid, 
succinimidyl ester (5(6)-SFX), "mixed isomers"; Oregon Green 514 carboxylic acid, succinimidyl 
ester; Rhodamine Red-X, succinimidyl ester, "mixed isomers" and Texas Red-X, succinimidyl 
ester, "mixed isomers" were purchased from Molecular Probes. 

1 0 Covalent coupling of activated derivatives of fluorophore dyes to a polymer carrier was performed 
in accordance with both manufacturer's protocols, with some modifications. Briefly, 1ml of each 
dye derivative (5mg/ml in anhydrous dimethylformamide) was separately mixed with 15ml of 50% 
slurry of UltraLink Immobilized DADPA in 0.2M sodium bicarbonate buffer, pH 8.3 (the coupling 
yield was at least 95% and. consequently, four fluorophore dye carriers contained 0.7-1 pMoles of 

1 5 covalently bound dyes per ml of beads), unreacted amino groups of the carriers were extensively 
acetylated by an addition of 18-fold molar excess of acetic acid N-hydroxysuccinimide ester 
(Sigma) followed by hydroxylamine treatment of the fluorophore dye carriers In order to destroy 
any unstable intermolecular Intermediates of fluorophore dyes via their hydroxyl groups (for 
Fluorescein- and Oregon Green 514 carriers only). Finally, fluorophore dye carriers were 

20 quenched in 0.75M Tris-HCI, pH 8.7. loaded to the columns and extensively washed in high salt 
buffer, 50mM Tris-HCI, pH 7.5 + im NaCI followed by a storage buffer. 50mM TrIs-HCI. pH 7.5 + 
150mM NaCI (TBS) + 0.05% sodium azide. Fluorophore dye carriers were stored as 50% slumes 
at 4^0. 

The chemical structures of the dye conjugates are shown in Figure 1. The resulting total spacer 
25 between the dye molecule and polymer carrier was 19-atoms long for Texas Red-, Fluorescein-, 
Rhodamine Red-conjugates and 15-atoms long for Oregon Green 514-conjugate. This allows 
maximal reduction of steric hindrance for potential interactions with the large bacteriophage 
particles that bear the peptide libraries. After preparation we determined that the fluorophore dye 
carriers contained from 0.7-1 pMoles of covalently bound dye per ml of carrier beads. 

30 EXAMPLE 2 

Selection for peptide aota mers that bind small molecule fluorophores 
A phage display peptide library was screened that contained a combinatorial library of 12-mer 
peptides fused via a short glycine linker spacer (GGG) to the amino-terminus of a minor coat pill 
protein (5 copies per particle. Li et at. (1980) J. Biol. Chem. 255:10331-10337) of the filamentous 
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bacteriophage M13mp19 (Ph.D. phage display, New England Biolabs). During phage maturation, 
the leader secretory sequence is removed. This results in the 12-mer peptide positioned 
immediately at the amino-terminus of the mature protein. 



E. coli ER2537 strain [F' lacP A(lacZ)M15 proA^B* /fhuA2 supE thi A(lac-proAB) A(hsdSM-mcrB)5 
5 (rk* mk' mcrBC" )] and E.coli TG-1 strain [supE thi-1 A(lac-proAB) A(mcrB-hsdSM)5 (rk- mk* ) (F 
traD36 proAB lacl^ZAM15)] were purchased from New England Biolabs and Stratagene, 
respectively, and used as bacteriophage M13mp19 hosts. All bacteriological techniques were 
performed as described (Sambrook et al., supra; Ausubel ef a/., supra). 

Each biopanning round consisted of four sequential steps: (i) phage binding with a fluorophore dye 
10 carrier, (ii) washing unbound phage from the beads, (iii) nonspecific elution of bound phage and 
(iv) amplification of bound phage. The Ph.D.-12 phage display peptide library (based on modified 
M13mp19 bacteriophage, 1.9x10^ independent transformants) as a part of the Ph.D. -12 phage 
display peptide library kit was purchased from New England Biolabs. All procedures were carried 
out at room temperature unless noted. First, 0.15ml of centrifuged fluorophore dye carrier beads 
15 were blocked with 3ml of TBS + 2mg/ml BSA (blocking buffer) for 1,5hr with gentle rotating, beads 
were washed with 15ml TBS + 0.1% Tween-20 + 0.5mg/ml BSA (binding buffer), centrifuged. 
mixed with 1.8x10^^ plaque-forming units (pfu) of the Ph. D.-12 phage display peptide library ("95- 
fold library size) in 3ml of binding buffer and suspension was gently rotated for 4hr. For washing, 
bead suspensions were centrifuged and beads were transferred to the microcolumns and slowly 

2 0 washed' by 100-fold beads volume of TBS + 0.1% Tween-20. The bound phage were 

nonspecifically eluted by 1ml of 0.2M glycine-HCI. pH 2.2 + 1mg/ml BSA (elution buffer) for 10' 
and the eluates were immediately adjusted to a neutral pH by 0.15ml of 1M Tris-HCI. pH 9.1. 
Bound phage yields were determined by a titration of the eluates on the ER2537 host strain 
(Sambrook ef a/., supra). Amplification step: all bound phage (1.1m!) were added to 27 ml of 
25 1/100 diluted overnight ER2537 culture and amplified for 4.5hr at 37*^C with vigorous shaking. 
The amplified phage were precipitated from cell supernatants by 1/5 V of 20% PEG + 2.5M NaCI 
at 4'*C, re-precipitated again and, finally, suspended in 0.5ml TBS + 0.05% sodium azide. The 
resultant "50-fold concentrated amplified phage were stored at 4**C and their titers usually were in 
the range 0.5-1 .5x1 0^^ pfu/ml. For long-term storage amplified phage were adjusted to 50% 

3 0 glycerol and stored at -20**C. 

The amplified phage were used for the next biopanning rounds against the corresponding 
fluorophore dye carrier, if necessary, until apparent enrichment for binding was observed over 
background. We observed a significant increase in the amounts of bound phage after 3-4 rounds 
of biopanning (for all of the fluorophore dye carriers see Figure 2). This was compared to the 
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nonspecific phage background in the first and second biopanning rounds. Eluted phage from 
round 4 selected against four different fluorophore dye carriers are termed herein as TR-4, RhR-4, 
OG-4 and Flu-4 (standing for Texas Red. Rhodamine Red. Oregon Green 514 and Fluorescein 
carriers, respectively). 

5 The amplified phage from round 1 were further selected against the corresponding fluorophore 
dye carrier. The next biopanning rounds were performed in the same manner as round 1, except 
that: (i) input phage in the binding reaction were increased to 4.5x10^^ pfu, (ii) time of binding was 
reduced to 2hr in round 4, (iii) Tween-20 concentration in a washing buffer was increased to 0.2% 
in round 3 and 0.4% in round 4. (iv) eluted phage from last round 4 were not amplified, but rather 
10 served as a source of independent phage clones used for sequencing. 

The TR-4, RhR-4, OG-4 and FIu-4 phage sets were used as a source of independent phage 
clones that were sequenced for further analysis. Phage single-stranded DNAs (ss DMAs) and 
double-stranded DNAs (ds DNAs) were purified as previously described (Sambrook et al., supra), 
Fluorette-coding portions of the DNA and adjacent DNA regions were sequenced with -96 gill 
15 sequencing primer CCCTCATAGTTAGCGTAACG (New England Biolabs) (SEQ ID NO:1 15) using 
an Applied Biosystems 391 automated DNA sequencer. 

The sequences from the insert regions of the phage are grouped in Table 1. Two unique 
sequences were found that bound to the Texas Red conjugate beads, one for Rhodamine Red, 
seven for Oregon Green, and six for Fluorescein. Note that the Oregon Green set and the 
2 0 Fluorescein set shared two sequences that were identical at the nucleotide level as well as the 
amino acid sequence presented. These and other issues relevant to such observations are 
explored later. 

Phage selected against the Texas Red earner gave rise to the sequences KHVQYV\n"QMFYS and 
DFLQWKLARQKP at a 5:1 ratio (Table 1). Biopanning with the phage display library against the 

2 5 Rhodamine Red carrier gave rise to a single phage clone. RhR401 . carrying the amino acid 

sequence IPHPPMYWTRVF. Thus, TR-4 and RhR^ phage sets may be considered to be nearly 
"pure" phage clone populations by the fourth round of selection. 

We tested the binding specificity and excluded the possibility that the TR401 and RhR401 phage 
were selected against the polymer linker moiety rather than a fluorophore dye moiety of the dye 

3 0 carriers. The TR-4 and RhR-4 phage were cross-bound to Rhodamine Red and Texas Red 

carriers, respectively. Cross-binding of each phage to the inappropriate dye carrier did not exceed 
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the nonspecific phage background binding {data not shown). Thus the TR401 and RhR401 
ftuorettes are specific by this comparison for their respective conjugated dyes, despite the 
similarity of the compound core ring structures. Second, they had affinity for the corresponding 
conjugated fluorophore dyes rather than only a polymer linker moiety of Texas Red and 
Rhodamine Red carriers or other chemical features of the carriers themselves. 
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Four rounds of biopanning with the phage display peptide library against Oregon Green 514 and 
Fluorescein carriers were undertaken. Three clones were predominant in the sequenced 
population selected against Oregon Green (OG401, OG402, OG403) and four clones were 
represented once (Table 1). Similarly, when selected against Fluorescein three clones 
5 predominated and others were represented only once. Notably two of the three predominant 
Fluorescein-specific fluorettes had the same sequence as two of the predominant, independently 
selected fluorettes that had been found with the Oregon Green 514 carrier. These were OG402 
fluorette (YPNDFEWWEYYF) and the OG401 fluorette (ASDYWDWEWYYS). Since Oregon 
Green 514 is structurally related to Fluorescein, and is considered a Fluorescein pentafluoride 
1 0 (see Figure 1 ). independent selection of the same peptide fluorettes against Oregon Green 514 
and Fluorescein carriers was predicted. The same fluorettes selected independently against 
these fluorophores possibly bind to similar domains of the Oregon Green 514 and Fluorescein dye 
molecules. 

The selected phage had been originally selected as being capable of binding fluorophore dyes that 
1 5 had been covalently linked to a polymer carrier. The nature of carrier-crosslinked fluorophores. 
while useful for Initial selection of phage, is Inappropriate to study affinity of the phage except in 
relative terms. We therefore tested whether the selected phage can bind to the corresponding 
free fluorophore dyes In solution. Each bacteriophage particle contains 5 copies of plll-fluorette 
fusion protein. Therefore phage binding is governed by avidity considerations. We bound free 

2 0 fluorophore dyes to highly concentrated phage solutions and assayed for bound fluorophore dye 

after precipitation of the phage. 

Four activated derivatives of fluorophore dyes (see above) were quenched in 0.5M Tris-HCI, pH 

8.7 and used as "free" dyes. Ten pi\ of PEG-purifled and 300-fold concentrated phage TR401, 

12 

RhR401 , OG402, OG403 (1 .3-1 .8x10 phage particles; calculated from optical densities of the 
25 purified phage at 260 and 280nm) In TBS + 0.05% sodium azide buffer were separately mixed with 
an equal volume of 20//M solution of the corresponding free dyes In the same buffer; mixtures 
were adjusted by Tween-20 to 0.1 % and incubated for 3hr. Phage-fluorophore dye complexes 
were precipitated three times by PEG-8000 in order to remove unbound dye and dissolved in 6.6/il 
of TBS + 0.05% sodium azide buffer. Finally, phage-fluorophore dye complexes (VI of each) 

3 0 were spotted to nitrocellulose filter and filter was scanned on the Storm 840 scanner (Molecular 

Dynamics) in Blue Fluorescence/Chemifluorescence mode with 200^* pixel resolution. To 
determine nonspecific binding the normalized amounts of phage particles of amplified, unselected. 
and PEG-purlfied Ph.D,-12 phage display peptide library were separately Incubated with the 
corresponding free dyes and treated in the same conditions. All binding experiments were 
3 5 accomplished in duplicate. Specific/nonspecific signal ratios were quantified by densitometry of 
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spot images using "Measure" option of NIH Image 1.59 freeware package. 

We found that OG403. OG402. RhR401 and TR401 phage specifically bound their respective 
fluorophore dyes (see Figure 3). This result shows that phage selected against covalently bound 
fluorophore dye were still capable of interacting with their cognate free fluorophore dye in solution. 

EXAMPLE 3 

Forced evolution of hioher affinity fluorophore-bindino peptides 
The total number of possible random 12-mer peptides is equal to 20^^=4. 1x10^^ The complexity 
of the phage display peptide library used for biopanning against the fluorophore dye carriers was 
much smaller, containing only 1.9x1 Cyclones. Thus, the library represents a fraction of all possible 
12-mer peptides that could have been searched for binding. We therefore sought to improve the 
present fluorettes. We chose to introduce mutations into the fluorette peptide sequences at the 
DNA level in the phage and then select for phage that displayed peptides with higher affinity 
(compared to the parent) against the respective fluorophore dye carrier. We chose the Texas 
Red- and Rhodamine Red-specific clones (TR401 and RhR401, respectively), as well as the 
predominant Oregon Green 514- and Fluorescein-specific fluorette clones (OG403 and OG402, 
respectively, see above), for this forced evolution based on initial indications of their affinities (data 
not shown). We set up oligonucleotide synthesis of the corresponding fiuorette-coded DNA 
sequences in such a manner that the mutagenesis rate was 9% for every 36 nucleotide positions 
of the peptide fluorette. Nucleotides A and C were omitted from the third position in each codon to 
improve the relative representation of all amino acids and by limits stop codon generation. 

Four minus-strand oligonucleotides containing degenerated fluorette-coding sequences were 
synthesized (Protein & Nucleic Acids Facility. Stanford University Medical Center). 

TR401-91CL: 

CTCCCCTTCGGCCGAACCTCCACCAGAATAAAACATCTGCGTCCAATACTGCACATGCTTA 
2 5 GAGTGAGAATAGAAAGGTACCACTCTCCC (SEQ ID NO:88); 
RhR401-91GL: 

GTCCCCTTCGGCCGAACCTCCACCAAACACACGAGTCCAATACATAGGAGGATGCGGAAT 

AGAGTGAGAATAGAAAGGTACCACTCTCCC (SEQ ID NO:89): 

OG402-91CL: 

3 0 CTCCCCTTCGGCCGAACCTCCACCAAAATAATACTCCCACCACTCAAAATCATrCGGATAA 
GAGTGAGAATAGAAAGGTACCACTCTCCC (SEQ ID NO:90): 
OG403-91CL: 
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CTCCCCTTCGGCCGAACCTCCACCCCAAGCAGTCCAATCCCAATAATAATCCCACCCATGA 
GAGTGAGAATAGAAAGGTACCACTCTCCC (SEQ ID Nd:91 ). 

Nucleotides in a regular case shows no degeneracy; boldfaced nucleotide designates 91% of 
shown nucleotide and 3x3% of each of the other three nucleotides; boldfaced and underlined 
5 nucleotide designates 91 % of A or C and 9% of C or A. respectively. 

Degenerated minus-strand oligonucleotides were separately annealed to the plus-strand 
oligonucleotide GGGAGAGTGGTACCTTTCTATTCTCAC (SEQ ID NO:92), partial duplexes were 
filled by T4 DNA Polymerase (Ausubel etaL, supra), cut by Kpnl and EagI and, finally, 1.9 ng of 
double digested filled duplexes was ligated to 200ng of low-melting point agarose gel-purified large 

10 fragment of the replicative form (RF) Ph.D. 121 DNA cut with the same pair of enzymes (the 

Ph.D.121 was a randomly picked phage clone from the Ph.D.-12 phage display peptide library) in 
a total volume 0.1 ml. Ligations were electroporated into 0.5ml of TG-1 electrocompetent cells 
(Stratagene) according to manufacturer's protocol and phage library complexities were determined 
by immediate mixing of several 10-fold dilutions of transfected TG-1 cells with ER2537 cells and 

15 plating. Phage libraries were further amplified in 400ml of liquid LB media for 4hr at S^^C with 
vigorous shaking, concentrated by PEG (see above) and. finally, suspended in 0.5ml TBS + 
0.05% sodium azide buffer. 

Titers of four resultant nonrandom combinatorial phage libraries were 0.2-4.0x10^^ pfu/ml. For a 
long-term storage phage libraries were adjusted to 50% glycerol and stored at -20**C. Twelve 
20 Independent phage clones from each TR401-91CL and RhR401-91CL libraries were sequenced in 
order to determine the average level of amino acid substitutions in the fluorette moiety. 

We determined the nucleotide sequences of twelve independent phage clones from each of the 
TR401-91CL and RhR401-91CL libraries in order to determine the average level of amino acid 
substitutions in the displayed fluorettes. These were determined to be 2,30 and 2.45. respectively. 

2 5 These experimental values correlated well to the expected theoretical values (calculated from a 

degeneration frequency 0.09 on the nucleotide level, see above). The nucleotide mutation rate 
from the other two libraries, OG403-91CL and OG402-91CL was not determined but likely had a 
comparable average level of amino acid substitutions In the fluorette moiety as all four 
corresponding oligonucleotides were synthesized in parallel using the same batch of pre-mixed 

3 0 monomer nucleotides. 

A key difference of the secondary blopanning versus the primary biopanning with the original 
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phage library is the strict need to maximize selection against the originating, parental fluorette. 
For this purpose we significantly reduced concentrations of the fluorophore dye carrier. We also 
reduced the phage concentrations present during the binding steps. In addition, the binding time 
was reduced while the amounts of washing buffer per ml of carrier beads were increased. 
5 Specifically, (i) input phage in the binding reaction were decreased to 1.0x10^° pfu, (ii) time of 
binding was reduced to 10', (iii) amount of fluorophore dye carrier beads in the binding reaction 
was reduced to 0.03ml, (iv) following binding to the phage beads were washed with 330-fold 
beads volume of washing buffer and (v) Tween-20 concentration in the washing buffer was 
increased to 0.4%. 

10 An increase In approximately 1.2-1 .5 logs of magnitude in the total number of bound phage after 
2-3 rounds of biopanning (compared to that of the first round) for all the fluorophore dye carriers 
was seen (see Figure 4). Eluted phage from round 3 were termed TRS-3. RhRS-3. OGS-3 and 
FluS-3, These phage represent a second generation of phage clones selected against the four 
fluorophore dyes. Texas Red. Rhodamine Red. Oregon Green 514 and Fluorescein, respectively. 

15 We isolated and sequenced a panel of Independent phage clones from each of the TRS-3. RhRS- 
3, OGS-3 and FluS-3 phage sets to determine the sequences of the mutants (Table 2). In the 
Texas Red selection half of those sequenced encoded the parental amino acid structure. This 
indicated successful, though partial, selection against the parental binding characteristics under 
these conditions. Four of the sixteen TRS401 progeny cloned carried a conservative S12T 

2 0 substitution. Each of these substitutions occurred independently as It can be observed they are 
associated with additional, independent mutations. This suggests that the S12T substitution Is 
probably an important change that enhances the affinity of the phage for the dye. Three progeny 
Texas Red fluorettes showed substitution at the second amino acid of histidine to P (found twice) 
or N. It is interesting to note that clone TRS31 1 carried both the H2P and the S12T mutations. 

2 5 Fewer mutant clones, only two, were observed for Rhodamine Red. The RhR401 phage clone 

might represent a local optima in affinity for Rhodamine Red. We found numerous in-frame 
deletions in the RhR401 fluorette-coded DNA sequences (about 33% of independent clones 
carrying large and small In-frame deletions in the fluorette moiety). This is an indicator of potential 
toxicity of the substituted mutant RhR401 fluorette sequences for bacteriophage growth and 

3 0 survival (data not shown). For Oregon Green 514 multiple different peptide sequences were 

selected, with only two independently containing the same, non-conservative, mutation (G2E). For 
Fluorescein the majority of peptide fluorettes selected were also not parental sequences. Several 
Independent mutations were Independently selected at fluorette positions 3 (N to S found twice). 5 
(conservative change of F to Y found three times). 6 (conservative change E to D found twice). 9 
3 5 (conservative E to D found twice), and 12 (conservative F to Y found 4 times). Also, certain 
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Table 2. Fluorophore dye-specific phage clones and peptide fluorette sequences. The 
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phage clones were isolated after three biopanning rounds with TR401-91CL, RhR401- 
91 CL, OG403-91CL and OG402-91CL nonrandom combinatorial phage display peptide 
libraries (see text), 

amino acid substitutions in mutant peptide fluorette vs the corresponding parent are 
shown, mutant peptide fluorette sequence is shown as many times as it was found (see 
phage clone frequency); dots designate the same amino acid as that in the 
corresponding parent; dash in a second position of the FluS304 fluorette designates in- 
frame deletion, 

original parental clones and the parental clones carrying silent mutation(s) in peptide 
fluorette moiety. 
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positions appeared favored for change, i.e., 2. 3, and 12 with no strong bias for the substituted 
residue. 

These mutations may be important for increasing the overall affinity of the peptide fluorette- 
fluorophore interaction. We tested one of each of the phage from the secondary screens for 
5 increases in avidity (Table 3). PEG-purified TR401 phage, TRS31 1 phage or the amplified Ph.D.- 
12 phage display peptide library phage as a negative control (input 1. 5x1 0®-1. 1x10^° pfu with 2 to 
5-fold increments) were incubated with 23.5pl of BSA-blocked and washed Texas Red carrier 
beads in binding buffer (see above) in total volume 40pl for 3hr. Beads suspensions were 
centrifuged, the supernatants (unbound phage) were titrated and dissociation constants were 
10 measured via a standard linear Scatchard plot. For TR401 and TRS31 1 phage bound/unbound 
ratios were quite reliable (6.3-13.1 in the range of phage concentrations shown above). All binding 
experiments were performed in duplicate and titrations were performed in triplicate. This direct 
binding assay can reliably measure if it does not exceed "1 .5-2nM. For higher K^, 
bound/unbound ratios became unreliable (=1). 

15 Other PEG-purified phage (input 5x10^-2.5x10^*^ pfu with 2 to 5-fold increments) were bound to the 
respective fluorophore dye carriers essentially in the same manner as described above for Texas 
Red-specific phage. RhR401 and RhRS308. OG403 and OGS316, OG402 and FluS303 phage, 
were bound to the Rhodamine Red. Oregon Green 514 or Fluorescein carriers, respectively. 
Following binding the beads were quickly washed twice by 0.25ml of TBS + 0.1% Tween-20 and 

2 0 suspended in 1 0ml TBS. 1 0pl of suspension was mixed with "2x1 0® log-phase ER2537 cells and 

incubated 1hr at 4*^0 with slight shaking to allow phage adsorption. Several ten-fold dilutions of 
infected cells were mixed with noninfected ER2537 cells and plated in standard plaque assay 
(Sambrook ef a/., supra). All binding experiments and titrations were accomplished in duplicate. 
Nonspecific background of binding (determined with amplified Ph.D.-12 phage display peptide 
25 library phage) did not exceed more than 3% of specific binding in each case. For all above phage 
pfu/particle ratio was in the range 0.4-0.5. 

The Texas Red progeny phage TRS31 1, which contained the apparently important S12T 
substitution, had a threefold increased avidity as compared to the TR401 parent. The affinity of 
the double mutant clone RhRS308 versus the parent was not even marginally improved (Table 3). 

3 0 For Oregon Green 514 the phage clone OGS316, which carried the G2E substitution, had a 2.7- 

fold higher avidity than its parent. The multiply substituted clone FluS303. which contained two 
putatively important changes, had a 6.5 fold increase in relative avidity. Thus, peptides can be 
matured to higher avidity. This likely is due to corresponding affinity increases for the individual 
peptide fluorette against its respective dye. 
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EXAMPLE 4 
Binding of peptides to fluorophores in solution 
To identify peptide fluorettes that bind fluorophore dye independent of the full context of the pill 
fusion bacteriophage protein we used synthetic peptides corresponding to the sequence of the 
5 selected fluorette region. We added to the peptide eight amino acids derived from the adjacent 
sequence of the pill protein. This would accommodate any partial contributions of context from 
the pill protein sequence. These were followed by a short GGG spacer and Hisg tag (see Table 
4). We synthesized peptides from the original library screening that had been capable in solution 
of specifically binding each of the three dyes (Texas Red, Oregon Green 514. Fluorescein). The 

10 Texas Red peptides chosen were TR401 and Its higher avidity progeny TRS31 1 ; a peptide 
corresponding to the unrelated Texas Red primary clone TR406 was also synthesized. The 
Oregon Green 514 clones OG401 and OG403 were synthesized to test, in part, the observation 
that both contained a common motif YWDW which might represent a common binding motif for 
Oregon Green 514. Note that the OG401 clone also bound to Fluorescein In the original 

15 screening. The other clone which bound to both Oregon Green and Fluorescein In the primary 
screen. OG402. was synthesized. The peptide derived from a secondary screen against 
Fluorescein. FluS303, which demonstrated a 6.5-fold higher affinity was also synthesized. No 
peptides for Rhodamine Red were synthesized. 

Peptides were bound to cobalt ion-coated Sepharose beads and the resultant peptide-coated 
20 beads were incubated with corresponding fluorophore dyes, washed, and visualized for dye 

binding. The same amounts of Texas Red-specific peptides PepTR401, PepTRS311. PepTR406 
as well as nonspecific peptide PepControl (see Table 4) were bound via Hisg tag to the TALON 
Metal Affinity Resin (Clontech) in TBS buffer essentially as described by manufacturer's protocol. 
Beads (5/il) were washed twice with 0.5ml TBS to remove unbound peptide and incubated with 

2 5 0.5mM Texas Red in 40^1 of TBS buffer for 1 hr at RT. Finally, the beads were washed three times 

with 0,3ml TBS to remove unbound dye. Fluorescent and nonfluorescent control beads were 
photographed with 10-fold magnification on fluorescent microscope Axiophot (Zeiss) using 
Rhodamine Red/Texas Red filter with 5" exposure. 

Most peptides failed to bind dyes to any detectable degree. However. Texas Red peptides 

3 0 PepTR401 and PepTRS31 1 showed significant binding to Texas Red while they bound to neither 

Oregon Green 514 nor Fluorescein (data not shown). PepTR406 did not bind to Texas Red. 
however. Lack of dye binding might be due to low peptide affinity for cognate dyes and/or 
because the binding requires contributions from the pill fusion protein not present in these 
peptides. 
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The above test only checks for bound dye that remains fluorescent at the wavelengths tested. 
Binding of fluorophore dyes to specific peptides might result in changes in the fluorescence 
spectra of bound dye versus the spectra of free dye. Or, one might observe an increase or 
quenching of fluorescence. We mixed concentrated peptide with 50nM of the corresponding dye in 
5 solution and determined the excitation and emission spectra of the resulting mixture. No 

detectable changes in peak excitation nor emission were observed for peptides selected against 
Oregon Green 514 or Fluorescein (data not shown), nor for PepTR406, nor for a control peptide 
(Figure 5B). 



Table 3. Phage - fluorophore dye binding affinity. Dissociation constants. 
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However, the peak positions of the excitation and emission spectra of soluble Texas Red bound to 
lOpM PepTR401 was shifted relative to Texas Red alone or as compared to a mixture of Texas 
Red and a nonspecific control peptide (Figure 5A). A +1.9 nm peak excitation shift with PepTR401 
was observed. The peptide was checked against Fluorescein as a nonspecific control and 
showed no shift in the Fluorescein excitation or emission spectra. An approximate peak excitation 
shift of +2.8 nm was effected upon dye binding with PepTRS31 1 . Interestingly, although 
PepTR401 had a shift in its peak excitation, no emission shift change was elicited upon binding 
fluorophore. However, PepTRS311 did manifest a significant +1.4nM peak emission shift in its 
spectra. We checked all other excitation and emission wavelengths and the changes observed 
correspond to a largely global shift in the spectra. No other significant changes in local optima 
were observed (data not shown). 

Interestingly, the TR401 peptide is the progenitor of the TRS31 1 peptide. Thus, the two 
substitutions H2P and S12T result not only in an increased binding, but also in an apparent 
differences in the excitation spectra elicited. Moreover, though both show a shift in the excitation 
15 only the selected, higher affinity. PepTRS31 1 had a peak emission spectra shift. Thus, one or 
both of the two substitutions, H2P or S12T. are critical for shifting the spectra of the emission 
profile. 

We attempted with these findings to obtain an affinity measurement for the higher affinity TRS31 1 
peptide. Peptides in different concentrations (1.3-20mM) were incubated with Texas Red (50nM) 
2 0 in 0.6ml of TBS buffer for 1 hr at RT. Excitation and emission spectra of samples were obtained by 
using spectrofluorimeter SPEX Fluoromax (Jobin Yvon-SPEX Instruments Co.) and DataMax 
software package. Peak positions were automatically calculated by software. List mode data was 
saved for other analyses. 

Reduction of the concentration of PepTRS311 from 10pM to 1.3pM reduced slightly, but not 
completely, the change in peak excitation position (Figure 5A). The narrow dynamic range of the 
shift and the broad emission spectra provided for a limited ability to obtain an approximate 
measure of the affinity. Affinity measurements calculated roughly from these results place the 
affinity of the peptide for the fluorophore at 0.1-0.5|jM. Further examination of this issue by other 
approaches is required to obtain more accurate reflections of the affinity and off-rates. However, 
together with the agarose bead-binding data we can conclude that we have successfully selected 
for peptides with sufficient affinity to bind small molecule fluorophore dyes in solution. In addition, 
the binding is specific and the spectral qualities of the dyes can be modulated dependent upon 
modifications of the bound peptide. 
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Table 4. Fluorette-carrying peptides. 



Fluorophore dye 


Peptide^ 


Sequence^ 


Texas Red 


Pep. TR401 


KHVOYWTOMFYSGGGSAETVGGGHHHHHH 


Pep. TRS311 


KPVOYWTOMFYTGGGSAETVGGGHHHHHH 


Pep. TR4 0 6 


DFLOWKLAROKPGGGSAETVGGGHHHHHH 


Oregon Green 514 


Pep. OG4 03 


HGWDYYWDWTAWGGGSAETVGGGHHHHHH 


Pep. OG401*= 


ASDYWDWEWYYSGGGSAETYGGGHHHHHH 


Pep. OG402^ 


YPNDFEWWEYYFGGGSAETVGGGHHHHHH 


Fluorescein 


Pep. FluS303 


YPNEFDWWDYYYGGGSAETVGGGHHHHHH 




Pep. Control** 


ASGSGASGSAGSGGGSAETVGGGHHHHHH 


a) peptide names reflect the names of phage 


clones carrying respective fluorettes except 



Pep. Control (see Tables 1 and 2) 

b) first twenty amino acids of the peptides are the same as in the pill fusion protein of the 
corresponding phage clones except Pep. Control (see Tables 1 and 2 and also New 
England Biolabs "Ph.D.-12 Phage Display Peptide Library Kit" manual), thus, amino 
acids 1-12 are the fluorette potion (double underlined) and amino acids 13-20 are the 
distal pill fusion part (underlined). Amino acids 21-29 contain a small GGG spacer 
followed by HISq tag (shown in regular case). (SEQ ID NOS:41-48) 

c) specific for both Oregon Green 514 and Fluorescein (see Table 1). 

d) nonspecific control peptide synthesized for specificity check. 
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Example 5 
Constrained Texas Red-Binding Peptides 
Constrained Texas Red-binding peptides were identified via five rounds of biopanning of a mixture 
of two constrained phage display libraries against a polymer carrier with covalently bound Texas 
5 Red. Two libraries contained SKVILFE-flanked nine or thirteen amino acid variable region in the 
N-terminal part of M13 bacteriophage pill protein. The structure of the libraries were as follows: 

GGGSKVILFEGPAG (X)9ori3 GAPGSKVILFEGGPG (SEQ ID NO:93)- (pill protein) 

SKVILFE-dimerizers are underlined. Flexible linkers or spacers GGG, GPAG ( SEQ.ID NO:g4), 
GAPG (SEQ.ID NO:95) and GGPG (SEQ.ID NO:96) are double underlined. X represents any 
10 amino acid. 

The flexible linker or spacer GPAG is encoded by a nucleotide sequence containing Fsel 
restriction endonuclease site. The flexible linker or spacer GAPG is encoded by a nucleotide 
sequence containing AscI restriction endonuclease site. Both Fsel and AscI restriction enzymes 
are rare eight-cutters. Thus, upon a double enzyme digestion, a universal cassette-library or 
15 cassettes containing specific sequences can be conveniently exchanged between different 
vector/host systems or different dimerizers of a choice (e.g., pair of cysteines, coiled coil 
structures, etc.). 

Biopanning against carrier with Texas Red ultimately revealed at least seven different Texas 
Red-binding constrained peptides (both 9- and 13-mers in a variable part): 

2 0 GGGSKVILFEGPAG RTIWEPKEASNHT GAPGSKVILFEGGPG 

GGGSKVILFEGPAG WSKMGHTVT GAPGSKVILFEGGPG 

GGGSKVILFEGPAG RWTWEPISE GAPGSKVILFEGGPG 

25 

GGGSKVILFEGPAG GNQKCLQHNRCST GAPGSKVILFEGGPG 
GGGSKVILFEGPAG SQTWSFPEH GAPGSKVILFEGGPG 



(TRP501) 
SEQ.ID NO:97 

(TRP505) 
SEQ.ID NO:98. 

(TRP512) 
SEQ.ID NO:99 

(TRP518) 
SEQ.ID NO:100 

(TRP526) 
SEQ.ID NO:101 
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GGGSKVILFEGPAG EPMARPWERKQDR GAPGSKVILFEGGPG 



(TRP527) 
SEQ.IDNO:102 



GGGSKVILFEGPAG GTLSATRPYGRQW GAPGSKVILFEGGPG (TRP541 ) 

SEQ.ID NO:103 

5 Consensus motife can be observed within these peptides (e.g., RXXWEP (SEQ ID NO: 104), WEP 
and TW; see Table 5) that suggests structural features common to the peptides that allow for 
efficient binding. Interestingly, no said constrained peptide has a significant homology with linear 
Texas Red-binding peptides. 

Binding affinity of phage clones revealed the most avid binders. TRP512 phage and TRP501 
10 phage, with equal to 25 pM and 80 pM. respectively. Other five phage clones were much less 
avid with K^ ^ 5 nM (see Table 6). 

Binding of synthetic peptides (nonspecific peptide. TR401. TRP501 and TRP512) to Texas Red is 
shown in Figure 6B-C. 

Binding affinity of synthetic TRP501 peptide is measured as K^^ 200 nM (=8-fold more avid than 
15 linear TR401 and TRS31 1 peptides). Binding affinity of synthetic TRP512 peptide is measured as 
Kd«10 nM 160-fold more avid than linear TR401 and TRS311 peptides). 

CONCLUDING REMARKS 
The foregoing description details specific methods which can be employed to practice 
the present invention. Having detailed such specific methods, those skilled in the art will well 
20 enough know how to devise alternative reliable methods at arriving at the same information in 

using the fruits of the present invention. Thus, however detailed the foregoing may appear in text, 
it should not be construed as limiting the overall scope thereof; rather, the ambit of the present 
invention is to be determined only by the lawful construction of the appended claims. All 
documents cited herein are hereby expressly incorporated by reference. 
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